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(57) Abstract: The present invention provides polymorphisms of ABCG2 polypeptide and polynucleotide coding therefor, which 
is related to the intracellular accumulation of indolocarbazole compounds, as well as methods for detecting the polymorphisms, 
comprising collecting a sample from mammals, and determining a polymorphism of the nucleotide sequence of ABCG2 gene or a 
polymorphism of the amino acid sequence of ABCG2 polypeptide.In a preferred embodiment of the present invention, the polymor- 
phism of the nucleotide sequence is one or more of single nucleotide polymorphisms at positions selected from the group consisting 
of 34, 376 and 421 of SEQ ID NO: 1 , and the polymorphism of the amino acid sequence is one or more of amino acid polymorphisms 
at positions consisting of 12, 126, and 141 of SEQ ID NO:2. 
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For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



Patent provided by Sughrue Mion, PLLC - http://www.sughrue.com 



WO 03/107249 



PCT/JP03/07534 



DESCRIPTION 

METHOD FOR PREDICTING A DRUG TRANSPORT CAPABILITY BY ABCG2 

POLYMORPHISMS 

5 

Technical Field 

The present invention relates to a polypeptide which excretes drugs 
such as cancer chemotherapeutic agents from a cell and to a gene coding 
therefor. More specifically, the present invention relates to a method for 
10 predicting a drug transport capability of a mammalian cell by determining a 
single nucleotide polymorphism(s) of ABCG2 gene and/or an amino acid 
polymorphism(s) of ABCG2 polypeptide and also to a polynucleotide, 
polypeptide, kit, and the like used for the method. 

15 Background Art 

Prediction of sensitivity to cancer chemotherapeutic drugs has been a 
subject in conventional cancer therapy by the cancer chemotherapeutic drugs. 
Anti-tumor activity of a chemotherapeutic drug shows a great difference 
depending on the type of cancer cells and physical trait of each patient. A 

20 chemotherapeutic drug is highly effective for some patients while, a resistance to 
the drug is observed for other patients. In addition, although tumors are 
sensitive to chemotherapeutic drugs in early stages, they exhibit multidrug 
resistance afterward. In the conventional methods however, it is very difficult to 
judge whether a chemotherapeutic drug is effective to a specific patient 

25 As a major cause for the difference of sensitivity to chemotherapeutic 

drugs, there is a difference in drug concentrations in cells due to the difference in 
drug excreting capability. In those cancer cells, each of the transporters which 
excrete the chemotherapeutic drugs out of the cell is a member of ABC 
transporter superfamily (ATP-binding cassette transporter superfamily) and is a 



Patent provided by Sughrue Mion, PLLC - http://www.sughrue.com 



WO 03/107249 



PCT/JP03/07534 



group of molecules which is localized in cell membrane and transports the 
substrate utilizing an energy source such as ATP hydrolysis. 

As representative examples of the transporter, there have been reported 
P-glycoprotein (hereinafter, referred to as tt P-gp M ) encoded on MDR1 gene and 
5 multidrug resistance-related proteins (hereinafter, referred to as "MRP") encoded 
on MRP subfamily genes such as MRP1, MRP2 and MRP3. P-gp is a 
molecular pump which was already known to be involved in multidrug resistance 
in multiple types tumor, while MRP is a transporter which was firstly found to be 
involved in multidrug resistance in lung cancer and, later, found to be expressed 

10 in other types of cancer as well (Cole, S. P. C. et a/., Science, 258, 1650-1654 
(1992) and Leslie, E. M. et a/., Toxicology, 167, 3-23 (2001)). 

In recent years, new ABC family molecules have been found in 
succession and, besides P-gp and MRP, molecular pumps that are suggested to 
be involved in drug resistance are being clarified. As one of such molecules, 

15 there is a molecular pump called ABCG2 (BCRP/MXR/ABCP). With regard to 
this, there have been named and reported ABCP as the gene which is 
expressed specifically in placenta (Allikmets, R. ef a/., Cancer Res. 58, 5337- 
5339 (1998)), BCRP as the gene obtained from a resistant cell line selected by 
adriamycin (Doyle, A. ef a/., Proc. Natl. Acad. ScL U. S. A. 95, 15665-15670 

20 (1998)) and MXR as the gene obtained from a resistant cell line selected by 
mitoxantrone (Miyake, K. et a/., Cancer Res. 59, 8-13 (1999)). Among these 
three kinds of genes, mutations of 1 to 4 amino acid(s) derived from the 
nucleotide substitution between the respective genes were observed. 

From the analysis of the cell line which is produced by introducing and 

25 expressing the nucleotide sequence reported as BCRP into MCF-7 cell, 
expression of this gene was shown to give resistance to mitoxantrone and 
adriamycin. Thus, the gene has been notable for a novel factor of multidrug 
resistance (Doyle, A. et a/., Proc. Natl. Acad. Sci. U. S. A. t 95, 15665-15670 
(1998) (WO 99/40110). 
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Under such circumstances, the present applicant found that the 
excretion pump of indolocarbazole compounds is an ABCG2 gene of SEQ ID 
NO:1 (Komatani, H. et a/., Cancer Research, 61, 2827-2832 (2001), WO 
02/28894). In the gene reported as BCRP, the 482nd codon encodes threonine, 
5 while the ABCG2 gene of SEQ ID NO:1 was a new nucleotide sequence where 
the 482nd codon encodes arginine. 

The ABCG2 gene of SEQ ID NO:1 is a gene which confers a selective 
resistance on a cell to a compound of the following general formula (I) 
(hereinafter, referred to as "indolocarbazole compound"): 



wherein X 1 and X 2 each independently represent a hydrogen atom, 
halogen atom or hydroxyl group; R represents a hydrogen atom, amino, 
formylamino, or lower alkylamino which may be substituted with any one 
selected from the group consisting of one to three hydroxyl group(s), a pyridyl 

15 group optionally having substituent(s), and thienyl group optionally having 
substituent(s); and G represents a pentose group or hexose group or derivative 
thereof which may be substituted with an amino group, more specifically, to the 
compound such as Compound A (wherein X 1 is 1 -hydroxyl group, X 2 is 11- 
hydroxyl group, R is formylamino and G is p-D-glucopyranosyl group in the 

20 general formula (I)) and to the compound such as Compound B (wherein X 1 is 
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2-hydroxyl group, X 2 is 10-hydroxyl group, R is (1-hydroxymethyl-2-hydroxyl) 
ethylamino group and G is p-D-glucopyranosyl group in the general formula (I)). 

It has been shown by Northern blotting analysis that the ABCG2 gene of 
the SEQ ID NO:1, for example, is highly expressed in the cells which are 

5 resistant to both Compound A and Compound B (Yoshinari, T. et a/., Cancer Res. 
59, 4271-4275 (1999)) and that the accumulation of indolocarbazole compounds 
represented by Compound A, Compound B etc. into the cells is selectively 
suppressed by the gene. (Komatani, H. et ah, Cancer Res. 61, 2827-2832 
(2001); WO 02/28894). Accordingly, analysis of the genetic polymorphisms 

10 affecting the activity or expression of ABCG2 comprising the ABCG2 gene of 
SEQ ID NO:1 is thought to be useful for the selection of anticancer drug used for 
the therapy. However, such a genetic polymorphism has not yet been known. 

Summary of the Disclosure 

15 Under such circumstances, there has been a demand for the 

development of methods for diagnosis of excreting capability of a transporter 
gene product which excretes chemotherapeutic drugs out of the cells in each 
patient. For example, cancer chemotherapeutic drugs having an anthraquinone 
skeleton such as adriamycin, doxorubicin and mitoxantrone are not well effective 

20 to cells when the P-gp, MRP or BCRP is detected to be highly expressed 
therein. 

Although the indolocarbazole compounds are effective anti-cancer drugs 
regardless of the expression of the P-gp or MRP, their effect to cancer cells 
where ABCG2 is highly expressed is low. 
25 However, if the genetic polymorphism affecting the activity or the 

expression of ABCG2 can be previously detected, the detection may be useful 
for the selection of anti-cancer drugs in cancer therapy and for the selection of 
inhibitors of ABCG2 activity in combined cancer therapy. 

For example, the ABCG2 gene of SEQ ID NO:1 which is widely found is 
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a gene giving an indolocarbazole compound-selective resistance on a cell while 
the ABCG2-Thr482 gene where the 482nd amino acid is modified to threonine 
gives a resistance to mitoxantrone and adriamycin in addition to indolocarbazole 
compounds and, therefore, a method for detecting the difference between those 

5 two genes is useful for the selection of anti-cancer drugs in cancer therapy. 

In addition, a detection of ABCG2 genetic polymorphism which lowers 
the activity of ABCG2 in advance, for example, is useful for finding the optimum 
dose of the indolocarbazole compound in cancer therapy. 

Accordingly, it is an object of the present invention to provide a 

10 polymorphism of ABCG2 polypeptide related to intracellular accumulation of 
indolocarbazole compounds and of a polynucleotide coding therefor. It is also 
an object of the present invention to provide a method for detecting the presence 
or absence of the polymorphism of ABCG2 polypeptide or polynucleotide coding 
therefor in the test sample derived from patients suffering from cancer, by using 

15 a nucleic acid which is specific to polymorphism of /\BCG2-related gene or 
antibody to ABCG2 polypeptide. It is a still another object of the present 
invention to provide a method for an effective use of indolocarbazole compounds 
by detecting the presence or absence of the polymorphism of ABCG2 
polypeptide or polynucleotide coding therefor. 

20 In order to solve the objects, the present inventors analyzed genomic 

DNA extracted from many human cancer cell lines and clinical samples and 
identified single nucleotide polymorphisms (SNPs) in the ABCG2 gene. It was 
found that those SNPs cause mutations such as an amino acid substitution and 
deletion at the specific sites of the ABCG2 polypeptide. Then, when cell lines 

25 expressing each of the specific mutant ABCG2 polypeptides were prepared and 
their resistance to drugs was tested, it was found that a drug transport capability 
of the mutant ABCG2 polypeptide greatly lowered as compared with that of wild 
type ABCG2 polypeptide. On the basis of such findings, the present invention 
has been accomplished. 
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Accordingly, in a first aspect of the present invention, there is provided a 
method for predicting a drug transport capability of a mammalian cell comprising 
the steps of collecting a sample from a mammal and determining at least a 
polymorphism of the nucleic acid sequence of ABCG2 gene or at least a 

5 polymorphism of the amino acid sequence of ABCG2 polypeptide. 

In a preferred embodiment of the present invention, the ABCG2 gene 
comprises a DNA consisting of the nucleotide sequence of SEQ ID NO:1 and the 
polymorphism of the nucleotide sequence is one or more of single nucleotide 
polymorphisms at positions selected from the group consisting of 34, 376 and 

10 421 of SEQ ID NO:l. It is further preferred that the single nucleotide 
polymorphism is selected from the group consisting of G34A, C376T and C421 A. 
Here, "G34A" means that the 34th guanine is substituted with adenine, "C376T" 
means that the 376th cytosine is substituted with thymine and "C421A" means 
that the 421st cytosine is substituted with adenine. Polymorphism of the 

15 nucleotide sequence can be determined by any one of methods selected from 
the group consisting of a direct sequencing method, TaqMan method, invader 
method, mass spectrometric method, RCA method and DNA chip method. 

In another preferred embodiment of the present invention, the ABCG2 
polypeptide comprises a polypeptide consisting of an amino acid sequence of 

20 SEQ ID NO:2 and the polymorphism of the amino acid sequence is one or more 
of amino acid polymorphisms at positions selected from the group consisting of 
the 12, 126 and 141 of SEQ ID NO:2. It is preferred that the amino acid 
polymorphism is an amino acid substitution of Val12Met or Gln141Lys or deletion 
of the amino acid sequence downstream from the position 126 of SEQ ID NO:2. 

25 The polymorphism of the amino acid sequence can be determined by any of 
methods selected from the group consisting of mass spectrometric method, 
two-dimensional electrophoresis method and protein chip method. 

In a still preferred embodiment of the present invention, the 
aforementioned drug is a compound represented by the following general 
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formula (I) (hereinafter, referred to as "indolocarbazole compound"). 

R 




00 

[In the formula, X 1 and X 2 each independently represent a hydrogen 
atom, halogen atom or hydroxyl group; R represents a hydrogen atom, amino, 

5 formylamino, or lower alkylamino which may be substituted with any one 
selected from the group consisting of one to three hydroxyl group(s), a pyridyl 
group optionally having substituent(s), and thienyl group optionally having 
substituent(s); and G represents a pentose group or hexose group or derivative 
thereof which may be substituted with an amino group]. 

10 In the second aspect of the present invention, there is provided a 

polynucleotide having a single nucleotide polymorphism(s) at one or more 
position(s) selected from the group consisting of 34, 376 and 421 of SEQ ID 
NO:1 wherein the polynucleotide comprises any one of the positions of the 
single nucleotide polymorphisms and consists of at least 10 contiguous 

15 nucleotides or a complementary polynucleotide thereto. In a preferred 
embodiment, the aforementioned single nucleotide polymorphism is selected 
from the group consisting of G34A, C376T, C421A and single nucleotide 
polymorphisms complementary thereto. 

In an embodiment, there is provided a polynucleotide having a 

20 nucleotide polymorphism(s) in the polynucleotide sequence of SEQ ID NO:1, 
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wherein the polymorphism is one or more of nucleotide polymorphism(s) 
selected from the group consisting of nucleotide polymorphisms by which the 
translated amino acid at position 12 is methionine, one at position 126 is stop 
codon and one at position 141 is lysine, and comprising at least 10 contiguous 

5 nucleotides including one or more of nucleotide(s) located at the site of the 
nucleotide polymorphisms, or complementary sequence thereof. 

In the third aspect of the present invention, there is provided a pair of 
PGR primers which specifically hybridize to the ABCG2 gene and amplify a DNA 
fragment of, a portion of the gene, wherein the amplified DNA fragment 

10 comprises a nucleotide at position 34, 376 or 421 of SEQ ID NO:1. In a 
preferred embodiment, the pair of PCR primers are any of the primer pairs 
selected from the group consisting of SEQ ID Nos. 5 and 6, SEQ ID Nos. 9 and 
10 and SEQ ID Nos. 11 and 12. 

In the fourth aspect of the present invention, there is provided a 

15 polynucleotide which specifically hybridizes to ABCG2 gene and is capable of 
detecting the polymorphism of ABCG2 gene at position 34, 376 or 421 of SEQ 
ID NO:1. In a preferred embodiment, the aforementioned polynucleotide can 
be used in any of the methods selected from the group consisting of a direct 
sequencing method, TaqMan method, invader method, mass spectrometric 

20 method, RCA method and DNA chip method. 

In the fifth aspect of the present invention, there is provided a mutant 
ABCG2 polypeptide having polymorphic mutation(s) to either (a) a human 
ABCG2 polypeptide consisting of the amino acid sequence of SEQ ID NO:2 or 
(b) an isopolypeptide of (a) consisting of an amino acid sequence of SEQ ID 

25 NO:2 wherein one or several amino acid(s) except for the amino acids at 
position12, 126 and 141 are deleted, substituted or added and having a drug 
transport capability. And the said mutant ABCG2 polypeptide is a polypeptide 
where one or both of the amino acid(s) at positions 12 and 141 of SEQ ID NO:2 
are substituted with other amino acid(s), or it is a polypeptide fragment 
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comprising the substituted amino acid(s) and at least 10 contiguous amino acid 
residues of the mutant ABCG2 polypeptide above, or it is a polypeptide where 
the amino acid residues downstream from the position 126 of SEQ ID NO:2 are 
deleted. 

5 In the sixth aspect of the present invention, there is provided an antibody 

which specifically binds to the mutant ABCG2 polypeptide in the fifth aspect of 
the present invention. 

In the seventh aspect of the present invention, there is provided a 
transformed cell which expresses an ABCG2 polypeptide having one or both of 

10 amino acid substitutions Val12Met and Gln141Lys to the amino acid sequence of 
SEQ ID NO:2 defined as either (a) a human ABCG2 polypeptide consisting of an 
amino acid sequence of SEQ ID NO:2 or (b) an isopolypeptide of (a) consisting 
of an amino acid sequence of SEQ ID NO:2, wherein one or several amino 
acid(s) except for the amino acids at positions 12, 126 and 141, are deleted, 

15 substituted or added, and having a drug transport capability. 

In the eighth aspect of the present invention, there is provided a method 
for measuring a drug transport capability using the transformed cell in the 
seventh aspect. 

In the ninth aspect of the present invention, there is provided a method 
20 for diagnosing a drug sensitivity comprising the steps of collecting a sample from 
a subject and determining the presence or absence of the polynucleotide in the 
second aspect or the polypeptide in the fifth aspect. In a preferred embodiment, 
it is suggested that the subject having the polynucleotide and/or polypeptide is 
sensitive to the indolocarbazole compound. 
25 In the tenth aspect of the present invention, there is provided a kit for the 

diagnosing a drug sensitivity comprising one or more of the polynucleotide in the 
second aspect, the pair of primers in the third aspect, the polynucleotide in the 
fourth aspect, the polypeptide in the fifth aspect, the antibody in the sixth aspect 
and the transformed cell in the seventh aspect. 
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In the eleventh aspect of the present invention, there is provided a 
computer system for the analysis of ABCG2 polymorphism comprising (a) an 
input-output device(s), (b) a memory (storage medium) containing the 
polymorphism data and (c) a central processing unit. 

5 

Brief Description of the Drawings 

Fig. 1 is a schematic diagram of the ABCG2 polypeptide showing the 
positions of single nucleotide polymorphisms of the present invention. 

Fig. 2 is a result of Northern blot analysis by which the amount of the 
10 ABCG2 mRNA in various transformed cells was determined. 

Modes for Carrying Out the Invention 
(Definitions) 

In the present specification, the following terms are defined as follows 
15 unless otherwise mentioned. "ABCG2" is a molecular pump which belongs to 
the ABC transporter superfamily and is a name of a polypeptide by which cancer 
chemotherapeutic drugs are excreted out of a cell or a gene coding therefor. 
The gene includes cDNA and genomic gene. 

The term "polymorphism" refers to the existence of more than one form 
20 of a gene, polypeptide or portion thereof. A portion of a gene, wherein there are 
at least two different forms, i.e., two different nucleotide sequences, is referred to 
as a "polymorphic region of a gene". A polymorphic region can be a single 
nucleotide pair, or can also be nucleotide pairs in some measure of length. The 
term "single nucleotide polymorphism (SNP)" refers to the replacement of one 
25 base by another base and, in human genome, it is presumed that an SNP is 
present in several hundreds to one thousand base pairs. Besides that, there 
exists some repetitive sequences having a different repeated number among 
individuals in the site where a unit of two bases to several tens bases is 
repeatedly present, which are called VNTR (variable number of tandem repeats) 
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and a microsatellite polymorphism. The SNPs have different functions 
depending upon the position where they are located, and some exist in the 
region which is translated to polypeptide resulting in substitution or deletion of 
amino acid sequence and affecting on the function of gene, some others exist in 

5 the region which controls the gene expression such as promoter or intron 
affecting on the expressed amount of gene, still some others exist in other region 
having nearly no influence on the gene expression. 

In the present specification, the term "mammalian cell" means tissues or 
cells that constitute animal body belonging to mammal or external cell cultures of 

10 such cells. The term "sample" means a sample containing polynucleotides 
derived from living organisms and includes living, dead or even archaeological 
sample collected from various tissues and cells. Specific examples are body 
fluid (blood, urine, saliva, and the like), skin, root of hair, mucous membrane, 
internal organs, placenta and cord blood. 

15 In the present specification, the term "drug" means a xenobiotic having a 

physiological activity including a cancer chemotherapeutic drug used for the 
purpose of treating cancer. It includes a synthetic compound, natural 
compound derived from plants or microorganisms and a semi-synthetic 
compound which is synthesized from the natural compound. Preferably, the 

20 "drug" means a compound represented by the following general formula (I) 
(hereinafter, referred to as "indolocarbazole compound"): 
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R 




G 



a) 

[wherein X 1 and X 2 each independently represent a hydrogen atom, 
halogen atom or hydroxyl group; R represents a hydrogen atom, amino, 
formylamino, or lower alkylamino which may be substituted with any one 

5 selected from the group consisting of one to three hydroxyl group(s), a pyridyl 
group optionally having substituent(s), and thienyl group optionally having 
substituent(s); and G represents a pentose group or hexose group or derivative 
thereof which may be substituted with an amino group]. More preferably, it 
means a compound of the general formula (I) wherein X 1 and X 2 each 

10 independently represent a halogen atom or hydroxyl group; R represents a 
hydrogen atom, formylamino, or lower alkylamino wherein said lower alkylamino 
may be substituted with any one selected from the group consisting of one to 
three hydroxyl group(s), a pyridyl group optionally having substituent(s), and a 
thienyl group optionally having substituent(s); and G represents a hexose group 

15 which may be substituted with an amino group. The production method and the 
like of the aforementioned indolocarbazole compounds have been disclosed in 
prior patent applications and registered patents (European patent publication 
0528030 A1, U. S. Patent Nos. 5,591,842, 5,668,271, and 5,804,564, WO 
95/30682, WO 96/04293, WO 98/07433 and JP Patent Kokai Publication No. 

20 JP-A-1 0-245390). Particularly with regard to the production methods of 
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Compound A and Compound B, they are disclosed in JP Patent Kokai 
Publication No JP-A-6-1 28283 and WO 95/30682, respectively. 

In the present specification, the term "polynucleotide" generally refers to 
both polyribonucleotide and polydeoxyribonucleotide, which can be either a 

5 non-modified RNA or DNA and either a modified RNA or DNA. The examples 
thereof are DNA, cDNA, genomic DNA, mRNA, unprocessed RNA and 
fragments thereof. Although there is no particular limitation in its length, it is 
usually about 1 0 bases or longer. On the other hand, the term "oligonucleotide" 
refers to those which are relatively shorter than the "polynucleotide" in length, 

10 which is generally about 50 bases or less. 

In the present specification, the term "polypeptide" refers to a compound 
which is linked with a peptide bond(s) consisting of two or more amino acids and 
includes a relatively short-chain polypeptide called as a peptide or oligopeptide 
and a long-chain polypeptide called as a protein. The polypeptide may contain 

15 amino acid(s) which is other than the genetically coded 20 kinds of amino acids. 
It is also possible to contain modified amino acid. Such a modified amino 
acid(s) is produced in vivo, for example, by a posttranslational processing or by a 
chemical modification which is known among the persons skilled in the art. The 
modification can be take place at main chain of peptide bond, side chain of 

20 amino acid, amino terminal or carboxyl terminal and includes, for example, 
acetylation, acylation, ADP ribosylation, amidation, biotinylation, covalent bond 
with lipid or lipid derivative, formation of cross-linking bond, disulfide bond, 
addition of sugar chain, addition of GPI anchor, phosphorylation and prenylation. 

25 (Method for Predicting the Drug Transport Capability) 

In an embodiment of the present invention, there is provided a method 
for predicting a drug transport capability of a mammalian cell, comprising the 
steps of collecting a sample from the mammal and determining at least a 
polymorphism of the nucleotide sequence of ABCG2 gene or at least a 
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polymorphism of the amino acid sequence of ABCG2 polypeptide. Here, the 
ABCG2 gene comprises a human cDNA having a nucleotide sequence shown in 
SEQ ID NO:1 which is a gene giving a resistance on a cell to the indolocarbazole 
compound represented by Compound A. It further comprises a human isogene 
5 which hybridizes under a stringent condition to a DNA complementary to the 
nucleotide sequence of SEQ ID NO:1, and also encodes a polypeptide having a 
drug transport capability, as well as mammalian homologues thereto. The 
condition of "to hybridize under a stringent condition" is an experimental 
condition for hybridization which has been known among the persons skilled in 

10 the art. To be more specific, it means two nucleic acid fragments hybridize 
each other under a hybridization condition described in "Expression of cloned 
genes in E. coir by J. Sambrook in 9.47 - 9.62 and 11.45 - 11.61 of "Molecular 
Cloning: A Laboratory Manual: 2nd edition (1989), Cold Spring Harbor 
Laboratory Press, New York, U. S. A. n . 

15 To be more specific, "under a stringent condition" means that, after 

hybridization at about 45°C in 6.0 x SSC, washing is conducted at 50°C in 2.0 
x SSC. For the selection of stringency, salt concentration in the washing step 
may be selected, for example, from a low stringency of about 2.0 x SSC at 50°C 
to a high stringency of about 0.2 x SSC at 50°C. It is also possible that 

20 temperature for the washing step may be increased from a low stringency 
condition of room temperature or about 22°C to a high stringency condition of 
about 65°C. Incidentally, it is possible for persons skilled in the art to achieve a 
hybridization condition of the same stringency as the above condition by an 
appropriate selection of various conditions such as diluting ratio of SSC, 

25 concentration of formamide and temperature. Accordingly, the isogene 
includes various mutant genes which have been known already. For example, 
BCRP gene obtained from adriamycin-resistant cell line, ABCP gene which is 
specifically expressed in placenta and MXR gene obtained from resistant cell 
line selected by mitoxantrone are different from human ABCG2 gene of SEQ ID 
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NO:1 in several nucleotide sequences but all of them are isogenes derived from 
ABCG2 gene of SEQ ID NO:1 and are included in "ABCG2 gene" of the present 
invention. 

ABCG2 polypeptide is: (a) a human ABCG2 polypeptide consisting of an 
5 amino acid sequence of SEQ ID NO:2; (b) an isopolypeptide to (a) consisting of 
an amino acid sequence of SEQ ID NO:2 wherein one or several amino acids 
except for the amino acid at positions 12, 126 and 141 are deleted, substituted 
or added and having a drug transport capability; or (c) a mammalian homologue 
to (a) or to (b). Here, the human ABCG2 polypeptide consisting of the amino 

10 acid sequence of SEQ ID NO:2 is a polypeptide which gives a selective 
resistance on a cell to the indolocarbazole compound represented by Compound 
A. The isopolypeptide may have modifications of deletion, substitution or 
addition of one or several amino acid(s) in the amino acid sequence of SEQ ID 
NO:2 so far as a drug transport capability which is a function of the ABCG2 

15 polypeptide is remained, and numbers of modified amino acid in the functionally 
identical polypeptide are usually within 10% of total amino acids, preferably 
within 10 amino acids and, more preferably, modification numbers are within 3 
amino acids (such as one amino acid). 

When the ABCG2 genetic polymorphism and ABCG2 polypeptide 

20 polymorphism are present in specific positions and those polymorphisms are 
present in more than a certain frequency in a specific population, the genetic 
significance of those polymorphisms becomes important. In a preferred 
embodiment of the present invention, specific SNPs as shown in Fig. 1 are 
disclosed. Fig. 1 shows schematically how the ABCG2 polypeptide is present 

25 in cell membrane along with the SNP sites according to the present invention. 
The ABCG2 polypeptide contains a leader sequence at its N-terminal necessary 
for localization to the cell membrane, followed by an ATP binding region (amino 
acids 61-270) and six transmembrane regions participating in the drug 
transportation. Fig. 1 shows four SNP sites which are mutations where the 34th 
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guanine in SEQ ID No. 1 is substituted with adenine (hereinafter, referred to as 
"G34A"), the 376th cytosine therein is substituted with thymine (hereinafter, 
referred to as "C376T"), the 421st cytosine therein is substituted with adenine 
(hereinafter, referred to as "C421 A") and the 458th cytosine therein is substituted 

5 with thymine (hereinafter, referred to as "C458T"). As a result of those SNPs, in 
the amino acid sequence of the ABCG2 polypeptide, the 12th valine from the N- 
terminal is substituted with methionine (hereinafter, referred to as "Val12Mef) f 
the 126th glutamine therefrom becomes a termination codon (hereinafter, 
referred to as "Gln126Term), the 141st glutamine therefrom is substituted with 

10 lysine (hereinafter, referred to as "Gln141Lys") and the 153rd threonine 
therefrom is substituted with methionine (hereinafter, referred to as "Thr153Met"). 
The mutation of Val12Met is present in a leader sequence necessary for 
localization of the ABCG2 polypeptide to cell membrane and the mutations of 
Gln141Lys and Thr153Met are present in an ABC (ATP-binding cassette) 

15 domain which is important for binding to ATP as transportation energy and, 
therefore, there is a strong possibility that those mutations affect on the drug 
transport capability of the ABCG2 polypeptide. It is apparent that a mutation of 
Gln126Term loses a drug transport activity since a complete ABCG2 polypeptide 
is not synthesized. 

20 Drug transport capability of those mutant ABCG2 polypeptides can be 

checked by preparing a transformant which expresses the mutant ABCG2 
polypeptide by means of a recombinant DNA technique. As will be illustrated in 
detail hereinafter in the present specification, it is noted that, as a result of 
measurement of drug sensitivity using those transformed cells, the drug 

25 transport activity of the mutant ABCG2 polypeptides of the aforementioned 
Val12Met and Gln141Lys is significantly low as compared with that of the wild 
type ABCG2. 

Alternatively, it is also possible to test whether those mutations are 
related to the drug sensitivity or not by analyzing biological samples obtained 
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from a group of the subjects having a high sensitivity to a specific drug and those 
obtained from a normal group and then analyzing the statistical relationship with 
the polymorphism of the present invention (so-called case-control study). The 
statistical analysis can be carried out using a program, etc. being known among 

5 the persons skilled in the art. 

With regard to a method for determining the polymorphism of the 
aforementioned nucleic acid sequence, that may be carried out using various 
known arts which will be mentioned below for (1) determination of nucleotide 
sequence of a part of allele containing at least a polymorphic site, (2) detection 

10 by a probe (allele-specific probe) which specifically hybridizes to a polymorphic 
site, (3) measurement of molecular weight of gene fragment containing a 
polymorphic site, etc. For example, SNP can be directly detected from 
genomic DNA by a direct sequencing method. On the other hand, it is also 
possible to use the aforementioned means for identification of (1)~(3) after a 

15 specific genomic DNA region is amplified. Various methods for the DNA 
amplification are known to those skilled in the art and include, but are not limited 
to, cloning of a desired DNA fragment, polymerase chain reaction (PCR), ligase 
chain reaction (LCR), strand displacement amplification (SDA; Walker G., et al. 
Proc. Natl. Acad. Sci. USA, 89, 392-396 (1992)), transcription-based 

20 amplification (Kwoh, D. et al., Proc. Natl. Acad. Sci. USA, 86, 1173-1177 (1989)), 
self-sustained sequence replication (Guatelli, J., et al., Proc. Natl. Acad. ScL 
USA, 87, 1874-1878 (1990)), the Q-p replicase system (Lizardi, P. et al., 
Bio/Technology, 6, 1197-1202 (1988)), nucleic acid sequence-based 
amplification (NASBA; Lewis, R., Genetic Engineering News, 12, 1 (1992)), the 

25 repair chain reaction (RCR), LAMP method (WO 00/28082), and the like. 

SNP of the amplified product can be determined by various methods, 
such as determination of the nucleotide sequence, measurement of molecular 
weight by MALDI-TOF mass spectrometry, etc. and analysis of restriction 
fragment length polymorphism (RFLP). The single strand conformation 
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polymorphism (SSCP) detection technique is also such another method for 
separation based on an acrylamide gel, etc., but non-denaturing conditions. It 
is also possible to carry out by a suitable capillary electrophoresis. This 
technique makes it possible to discriminate between different DNA fragments by 
5 their conformation (Orita, et al., Proc. Natl. Acad. ScL USA, 86, (1989), Cotton 
Mutat Res., 285, 125-144 (1993), Hayashi, Genet Anal. Tech. Appl. 9, 73-79 
(1992)). 

TaqMan (trademark) method is a method for the detection of SNP using 
a fluorescent energy transfer phenomenon where hybridization of an allele- 

10 specific oligonucleotide to a template is performed simultaneously with PCR (cf. 
Livak, et a/., PCR Methods and Application, 4:357-362, 1995 and U. S. Patent 
No. 5,528,848). The allele-specific probe which is labeled with a fluorescent 
dye and a quencher is hybridized to a target site and PCR is performed using a 
primer which is designed to amplify the region including the site whereupon the 

15 hybridized probe is cleaved by 5-nuclease activity of Taq polymerase as the 
elongation reaction from the primer proceeds. When the fluorescent dye is 
separated from the quencher, fluorescence is resulted and the template is 
amplified by the PCR whereupon fluorescent intensity is potentiated 
exponentially. When probes which are specific to two kinds of alleles are 

20 labeled with different fluorescent dyes, it is possible to discriminate homozygote 
from heterozygote by one assay. 

A variety of methods without amplification of DNA have been developed. 
For example, Invader method (trademark) is based on a special enzymatic 
reaction where two kinds of oligonucleotides (invader probe and allele probe) are 

25 used and a specific structure formed by those probes with a template DNA is 
recognized and cleaved and it is described, for example, in U. S. Patent Nos. 
5,846,717, 5,614,402, 5,719,028, 5,541,311 and 5,843,669. In this method, the 
target nucleotide sequence is recognized by two different probes. The first 
probe is usually called an invader probe and is substantially complementary to 
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the first site of the target nucleotide sequence. The second probe is called 
allele probe and its 3'-terminal side is substantially complementary to the second 
site of the target nucleotide sequence while its S'-terminal side contains a 
sequence called a tail or flap which is non-complementary to the template to 

5 form a single strand. When those probes hybridize to an adjacent region of the 
template, the 3'-terminal of the invader probe invades into an SNP site and this 
structure is cleaved by cleavase whereupon a flap is liberated. The liberated 
flap can be quantified when it is previously labeled. Preferably, in order to 
quantify the liberated flap, the third FRET (fluorescence resonance energy 

10 transfer) probe (including a sequence complementary to the flap and a self- 
complementary sequence) labeled with a fluorescent dye and a quencher may 
be used. The liberated flap forms a specific structure by binding to the FRET 
probe, and the part of fluorescent dye in the FRET probe is cleaved by cleavase 
whereupon fluorescence is generated. When two sets of flap-FRET probes are 

15 prepared and labeled with different fluorescent dyes, it is possible to discriminate 
each homozygote and heterozygote by one assay. 

MALDI-TOF mass spectrometry is a method which can process a large 
number of samples in a short time without fluorescence labeling of the primer. 
A primer adjacent to the SNP site is prepared and one base elongation reaction 

20 from the primer is performed using ddNTP and PCR-amplified sample DNA as a 
template. The ddNTP added to the elongation reaction product is discriminated 
by mass spectrometry. 

RCA (rolling circle amplification) is a method where a DNA amplifying 
means, in which a long complementary-stranded DNA is synthesized as DNA 

25 polymerase moves on a cyclic single-stranded template DNA, is applied to an 
SNP typing. Recognition of SNP (allele) is carried out by checking whether 
amplification is available by an RCA method. Namely, a single-stranded probe 
(padlock probe), which is able to give a ring form when annealed with genomic 
DNA, is hybridized to genomic DNA to conduct a chain reaction. When the 
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terminal of the probe is made as a site of SNP f a connected ring is formed 
whereupon amplification by RCA takes place if that site is matched while, when 
mismatched, no connection takes place giving no ring whereupon RCA 
amplification does not take place. Discriminating the two amplification 
5 reactions make it possible to determine SNP (Lizardi, P. M., et al., Nat Genet, 
19, 225 (1998)). 

DNA chip method where PCR-amplified fluorescence-labeled cDNA or 
cRNA is hybridized to various oligonucleotide probes including polymorphism 
site using DNA chip arranged on a microarray is useful as a means for quick 

10 detection of many SNPs. There have been known a thing where 
oligonucleotide is synthesized on an array by an optical lithographic technique 
so that several thousands to several hundred thousands probes are arranged on 
a chip (manufactured by Affymetrix; cf. U. S. Patent Nos. 5,424,186, 5,744,101 
and 6,040,138), and a method where a previously-prepared cDNA or 

15 oligonucleotide is fixed on glass by means of pin or ink jet system (cf. U. S. 
Patent No. 6,040,138). 

With regard to a method for determining the polymorphism of the 
aforementioned amino acid sequence, various methods have been known and 
examples thereof are a proteome analysis by a two-dimensional electrophoresis 

20 or microfluidics method (Vreeland, Wyatt N and Barron, Annelise E, Current 
Opinion in Biotechnology, Vol. 13, pages 87-94 (2002)), peptide mapping and 
amino acid sequence analysis using a mass spectrometric devices, amino acid 
sequence analysis by a protein sequencer and a method where interaction 
between polypeptide and ligand is detected using protein chips, etc. 

25 The two-dimensional electrophoresis is usually a method where 

isoelectric focusing is conducted in the first dimension while SDS-PAGE (sodium 
dodecyl sulfate-polyacrylamide gel electrophoresis) is conducted in the second 
dimension, and several thousands of proteins can be separated by one sheet of 
gel. In the isoelectric focusing, a carrier-ampholyte has been used already but, 
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in recent years, immobilized pH-gradient gel; IPG) strips have been put on the 
market and it is now possible to separate with a good reproducibility without 
causing a pH drift. In the SDS-PAGE, there are a continuous buffer system 
where one type of pH buffer is used and a discontinuous buffer system using 

5 buffers of plural pH values. It is also possible to use a low-BIS concentration 
gel electrophoresis, a concentration-gradient gel electrophoresis, a Tricine- 
SDS-PAGE, etc. depending upon the type of the protein to be separated. The 
separated protein can be usually quantified by staining with a dye such as 
Coomassie Blue. In a silver staining method, protein can be identified in a 

10 sensitivity of 20- to 1 00-fold as compared with the staining with Coomassie Blue. 
Alternatively, detection with a high sensitivity on a gel is possible using 
commercially available fluorescent dyes such as SYPRO Ruby and SYPRO 
Orange (Patton, W. F. ( Electrophoresis, 21, 1123-1144 (2000)). It is also 
possible to specifically detect an ABCG2 polypeptide by a western blotting 

15 method using an antibody to the ABCG2 polypeptide. 

Mass spectrometry is a technique by which mass (molecular weight) is 
precisely measured and, in recent years, this method makes it possible to 
measure precisely the molecular weights of the nucleic acids and proteins by 
making practical use of ionizing (highly hydrophilic) high-molecular substances 

20 having high polarity such as protein, etc. without decomposition. As one of 
such mass spectrometric methods, there has been known MALDI-TOF/MS 
(matrix-assisted laser desorption ionization time-of-flight/mass spectrometry). 
This is a method where, after mixing of a protein sample with a matrix which 
absorbs laser beam such as sinapinic acid (3,5-dimethoxy-4-hydroxycinnamic 

25 acid) followed by drying, a strong pulse laser is irradiated to conduct ionization of 
the protein sample by energy transfer from the matrix and molecular weight of 
the ion is analyzed by the difference in flying time of the molecular ion samples 
by the initial acceleration. In order to fragment the peptide in the inner area of 
the mass spectrometer and to obtain a structural information (such as amino 
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acid sequence or amino acid composition) from analysis of the mass of the 
fragment, a tandem mass spectrometry (MS/MS) where plural mass separating 
parts are connected is utilized and, for such a purpose, there are also used 
analyzers of a triple quadrupole type or a hybrid type using an electrospray 
5 ionization method and of an ion trap type, etc, 

Protein chip method is a technique which involves interaction of a 
sample with proteins, peptides, etc. placed on a substrate comprehensively and 
quickly and, with regard to ligands to be immobilized on the substrate, there 
have been developed peptides, antibodies, expressed proteins, etc. 

10 

(Polynucleotide, Pair of primers and Kit) 

In another embodiment of the present invention, there are provided a 
polynucleotide containing the ABCG2 genetic polymorphism, a pair of primers 
for amplifying the DNA fragment containing the aforementioned polymorphisms, 
15 a polynucleotide for the detection of the aforementioned polymorphisms and a kit 
therefor. 

In an embodiment, the polynucleotide of this embodiment is a 
polynucleotide having a single nucleotide polymorphism(s) at one or more 
position(s) selected from the group consisting of 34, 376 and 421 of the SEQ ID 

20 NO:1 wherein the polynucleotide comprises any one of the positions of the 
aforementioned single nucleotide polymorphisms and consists of at least ten 
continuous nucleotides or complementary nucleotide thereto. Accordingly, a 
polynucleotide containing none of the single nucleotide polymorphisms at the 
aforementioned three positions or, in other words, a polynucleotide where the 

25 nucleotide sequences in the above three places are the same as those in SEQ 
ID NO:1 is not included in the present embodiment Preferably, in those 
nucleotide sequences, the 34th guanine is substituted with adenine, the 376th 
cytosine is substituted with thymine or the 421st cytosine is substituted with 
adenine and, in addition to those, substitution in the nucleotide sequence where 
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codon of the 12th amino acid from N-terminal of the ABCG2 polypeptide is 
varied to methionine, the 141st codon thereof is varied to lysine or the 126th 
codon thereof is varied to termination codon is included as well. 

Those polynucleotides may be natural or synthetic compounds. For 
5 example, they may be manufactured by replication (duplication) of cDNA or 
genomic DNA within host cells using recombinant DNA techniques. 
Alternatively, they may be manufactured by synthesis in vitro. With regard to 
the synthetic method, it is possible to amplify the DNA by means of PCR or the 
like or to synthesize the DNA by chemical synthesis. For the persons skilled in 

10 the art, it is possible to introduce a site-specific mutation(s) into the ABCG2 gene 
of SEQ ID NO:1 by a known method to prepare the polynucleotide of the present 
embodiment. Examples of the method for introduction of site-specific 
mutations which is known to the persons skilled in the art are Kunkel method 
(Kunkel, T. A. et al., Methods Enzymol. 154, 367-382 (1987)), double primer 

15 method (Zoller, M. J and Smith, M M Methods Enzymol, 154, 329-350 (1987)), 
cassette mutation method (Wells, et al., Gene, 34, 315-23 (1985)) and mega- 
primer method (Sarkar, G. and Sommer, S. S., Biotechniques, 8, 404-407 
(1990)). 

Those polynucleotides may be used for the detection of the genetic 
20 polymorphisms concerning the present invention. They may be also used for 
the suppression of the gene expression as antisense DNAs. 

In another embodiment, there is provided a pair of PCR primers which 
specifically hybridize to the ABCG2 gene and amplify a DNA fragment of a 
portion of the gene wherein the amplified DNA fragment comprises a nucleotide 
25 at position 34, 376 or 421 of SEQ ID No. 1. The pair of primers of this 
embodiment are designed so as to be substantially complementary to each 
chain in specific regions of upper stream and downstream of the aforementioned 
polymorphic sites of ABCG2 gene. Although each of those primers can be 
hybridized at the region which is apart in 25-2500 base pair, it is preferred that 
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the size of the amplified product is 100-500 base pair so as to make 
determination of nucleotide sequence or analysis of molecular weight of the 
amplified product easy. More preferably, the size of the amplified product is 
80-200 base pair. Although the length of those oligonucleotide primers may be 
5 within a range of 10-30 bases, there may be used an oligonucleotide primer 
having preferably 18-25 bases and, more preferably, an oligonucleotide primer 
having 20-22 bases as shown in SEQ ID Nos. 5 and 6, SEQ ID Nos. 9 and 10 
and SEQ ID Nos. 11 and 12, respectively. Those primers may be labeled for 
making the detection of the amplified DNA fragment easy. With regard to the 

10 label, there may be used, for example, radioisotope, enzyme, fluorescent dye, 
streptoavidin, avidin, magnetic beads, antigen and antibody. 

In still another embodiment, there is provided a polynucleotide which 
specifically hybridizes to the ABCG2 gene and which is capable of detecting a 
polymorphism of the ABCG2 gene at positions 34, 376 or 421 of SEQ ID NO:1 . 

15 With regard to a method for the detection, there are various methods and, for 
example, in conducting a detection by an invader method, there are provided an 
invader probe being designed to complimentarily bind to the 3'-side of the 
template from the SNP site, and an allele probe containing a complementary 
sequence to the 5'-side of the template from the SNP site and having a 

20 sequence (flap) which is unrelated to the sequence of the template at the 5'-side 
thereof. The 3'-terminal of an invader probe which is a sequence of SNP site 
may be any base. 

ATaqMan probe is a polynucleotide which contains an SNP site and has 
a length of about 20 bases being complementary to a template. Its S'-terminal 
25 is labeled with a fluorescent dye such as FAM or VIC while 3'-terminal is labeled 
with a quencher (optical quenching substance). 

In a padlock probe used for RCA method, each of its both ends 
comprises about 20 bases near the SNP on genome and two of them are linked 
by a specific sequence called a backbone. 



Patent provided by Sughrue Mion, PLLC - http://www.sughrue.com 



WO 03/107249 



PCT/JP03/07534 



Preferably, the aforementioned detection is carried out by determining 
the nucleotide sequence using genomic DNA directly or using amplified DNA 
fragments. A sequence primer for the determination of a nucleotide sequence 
is designed so as to make it substantially complementary to an appropriate site 
5 in upper stream or downstream of the SNP site. Although the length of those 
sequence primers may be within a range of 10 to 30 bases, those having 18 to 
25 bases may be preferably used and, more preferably, primers in an ordinary 
direction (sense primer) or primers in an inverted direction (antisense primer) as 
shown in SEQ ID Nos. 37, 38, 41, 42, 43 and 44 may be used. Those 

10 oligonucleotides may be chemically synthesized by various methods which are 
known among the persons skilled in the art. They may be also labeled for 
making the detection easy. With regard to a method of labeling, there may be 
used radioisotope, enzyme, fluorescent substance, streptoavidin, avidin, biotin, 
magnetic fine particles, antigen and antibody, etc. 

15 In another embodiment, there is provided a kit for the prediction and 

detection of drug transporting capability of mammals. The kit contains either or 
both of a pair of primers for amplifying the DNA fragments containing the ABCG2 
polymorphism and a polynucleotide for detecting the polymorphism. It is 
possible that the target DNA is firstly amplified from the sample to be tested and 

20 genetic polymorphisms are determined using the amplified DNA. On the other 
hand, it is also possible to determine the polymorphism directly from genomic 
DNA without amplification reaction of DNA. With regard to such a method, 
Invader method may be exemplified. As optionally selected attachments, the 
kit may contain a reagent for extraction and for purification of DNA, reagent for 

25 PGR such as 10-fold concentrated buffer, heat-resistant DNA polymerase, four 
kinds of nucleotide triphosphates (dNTPs), etc. 

(Polypeptide) 

In still another embodiment of the present invention, there is provided a 
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polypeptide having a polymorphic mutation relevant to the present invention in 
the ABCG2 polypeptide or isopolypeptide thereof. The ABCG2 polypeptide is a 
polypeptide consisting of an amino acid sequence of SEQ ID NO:2 and the 
isopolypeptide thereof is a polypeptide consisting of an amino acid sequence 

5 where one or several amino acid(s) is/are deleted, substituted or added in the 
ABCG2 polypeptide and having a drug transport capability. Mutant 
polypeptides which have been known already such as BCRP, ABCP and MXR 
are also included therein. The polymorphism in this embodiment is substitution 
of either or both of the 12th and 141st amino acids in SEQ ID NO:2 with other 

10 amino acid(s) or deletion of an amino acid sequence which is downstream from 
the 126th of SEQ ID NO:2. The amino acid substitution may be anything so far 
as it is other than an amino acid residue shown in SEQ ID NO:2 but, preferably, 
the 12th and 141st amino acids in SEQ ID NO:2 are substituted with methionine 
and lysine, respectively. 

15 The present embodiment further includes polypeptide fragment of the 

ABCG2 polypeptide or isopolypeptide thereof where either or both of the 12th 
and 141st amino acid(s) of SEQ ID No. 2 is/are substituted with other amino 
acid(s). The polypeptide fragment comprises at least 10 contiguous amino acid 
residues, preferably 20 or more contiguous amino acid residues and, more 

20 preferably, it has the length of 30 or more amino acid residues. Those 
polypeptides or fragments thereof are useful for the preparation of an antibody to 
the polypeptide having polymorphic mutations. 

Such polypeptides having polymorphic mutation(s) can be manufactured 
by means of chemical synthesis and they further include natural polypeptides 

25 and those which are prepared as recombinant polypeptides utilizing genetic 
recombination techniques. The natural polypeptides may, for example, be an 
extracted and purified polypeptide from tissues such as placenta where the 
human mutant ABCG2 polypeptide of this embodiment is thought to be 
expressed. On the other hand, the recombinant polypeptides can be prepared, 
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as will be mentioned later, by cultivation of cells transformed by DNA coding for 
the human mutant ABCG2 polypeptide of this embodiment. 

The expressed or isolated polypeptide or the fragment thereof may be 
detected by known methods and it is possible to detect by, for example, 
5 Coomassie Blue staining, silver staining, western blotting method using an 
antibody specific to polypeptide having a polymorphic mutation, etc. In addition, 
those polypeptides may be purified by the methods which have been known 
already. Those methods include precipitation with ammonium sulfate, gel 
filtration chromatography, ion-exchange chromatography and affinity or 
10 immunochromatography. 

(Antibody) 

In an embodiment of the present invention, an antibody which 
specifically binds to the mutant ABCG2 polypeptide having a polymorphism(s) of 

15 the present invention is provided. The antibody of this embodiment can be 
prepared according to the method known to the skilled person in the art (refer, 
for example, to "Shin Seikagaku Jikken Koza (New Experimental Course of 
Biochemistry) 1, Protein I, pages 389-406, Tokyo Kagaku Dojin"). The 
preparation of polyclonal antibody is performed, for example, as follows. To an 

20 immunocompetent animal such as rabbit, guinea pig, mouse and chicken is 
administered the appropriate dose of the mutant ABCG2 protein of the present 
invention or the partial peptide thereof. The administration may be 
accompanied by an adjuvant (FIA or FCA) which promotes the antibody 
production. It is generally administered every several weeks. Multiple 

25 immunizations can elevate the antibody titer. After the final immunization, 
antiserum is obtained by collecting blood from the immunized animal. The 
polyclonal antibody can be prepared from this antiserum by, for example, 
fractionation with ammonium sulfate precipitation and/or anionic exchange 
chromatography and/or by affinity purification using Protein A and/or immobilized 
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antigen. On the other hand, a monoclonal antibody is prepared, for example, 
as follows. The mutant ABCG2 polypeptide of the present invention or the partial 
peptide thereof is immunized to an immunocompetent animal as described 
above, and after the final immunization, spleen or lymph node is collected from 

5 the immunized animal. A hybridoma cell is prepared by the cell fusion of the 
antibody-producing cell which is contained in this spleen or lymph node and a 
myeloma cell using polyethylene glycol or the like. The aimed hybridoma is 
screened and cultivated and a monoclonal antibody can be prepared from the 
culture supernatant. Purification of the monoclonal antibody can be performed, 

10 for example, by fractionation with ammonium sulfate precipitation and/or anion 
exchange chromatography and/or by affinity purification using Protein A and/or 
immobilized antigen. The antibody thus prepared is used for an affinity 
purification of the mutant ABCG2 polypeptide of the present invention and may 
be used for detecting the amount of the expression of the mutant ABCG2 

15 polypeptide of the present invention as well. The detection of the expressed 
amount of the mutant ABCG2 polypeptide of the present invention in a 
mammalian cell by the antibody makes it possible to determine the sensitivity of 
the mammalian cell to the compound represented by the formula (I). It is 
further possible that the detection of the mutant ABCG2 polypeptide of the 

20 present invention in a cancer cell or cancer patient by this antibody can be used 
for the pharmacogenomical therapy which determines the patient's constitution 
such as drug sensitivity for administration of optimum drug for the patient. 

(Transformed Cells and Method for Measuring the Drug Transport Capability 
25 Using the Transformed Cells) 

The present invention further relates to a transformed cell which 
expresses an ABCG2 polypeptide having one or both of amino acid 
substitution(s) Val12Met and Gln141Lys of the amino acid sequence shown by 
SEQ ID NO:2 in: (a) a human ABCG2 polypeptide consisting of an amino acid 
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sequence of SEQ ID NO:2 or (b) an isopolypeptide of (a) consisting of an 
amino acid sequence of SEQ ID NO:2 wherein one or several amino acid(s) 
except for the amino acids at position(s) 12, 126 and 141 is/are deleted, 
substituted or added and having a drug transport capability. The term 

5 "transformed cell" refers to a cell where an exogenous DNA is incorporated into 
a host cell by a recombinant vector. The host cell may be either a prokaryotic 
cell or an eukaryotic cell and includes any cell which can be used for the object 
of the present invention, such as bacterium, yeast cell, insect cell or animal cell. 
To be more specific, it is possible to introduce the recombinant vector into the 

10 host cell by the following method whereupon the transformant is obtained. 
Transformation of Escherichia coli is carried out by the method of Hanahan 
(Hanahan, D. f J. Mol. Biol. 166, 557-580 (1983)), the electroporation method 
(Dower, W. J., et al., NucL Acid Res. 16, 6127-6145 (1988)), and the like. 
Transformation of yeast is carried out, for example, by spheroplast method 

15 (Beach, D. and Nurse, P., Nature, 290, 140 (1981)), lithium acetate method 
(Okazaki, K., et al., Nucleic Acids Res., 18, 6485-6489 (1990)), etc. 
Transformation of insect cell may be carried out by a method, for example, 
described in Bio/Technology, 6, 47-55 (1 980). Introduction of recombinant DNA 
into mammalian cells is carried out by a calcium phosphate method (Graham, F. 

20 L. and van der Eb, A. J., Virology, 52, 456-467 (1973)), a DEAE-dextran method 
(Sussman, D. J. and Milman, G., Mol Cell 8/o/., 4, 1641-1643 (1984)), a 
lipofection method (Feigner, P. L. et al., Proc. Natl. Acad. Sci. USA, 84, 7413- 
7417 (1987)), an electric perforation method (Neumann, E., et al., EMBO J., 1, 
841-845 (1982)), etc. The transformed cells prepared as such may, for 

25 example, be used for a method for measuring the drug transporting activity, 
analysis of the drug excretion mechanism or screening of compound which 
regulates the drug transporting capability. 

(Diagnosing Method and Diagnosing Kit) 
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In a different embodiment of the present invention, there is provided a 
method for diagnosing a drug sensitivity of a subject by the detection of the 
polymorphisms of the present invention or a kit therefor. The drug includes 
cancer chemotherapeutic drugs and it is clinically useful to diagnose the 
5 sensitivity therefor. For example, when a chemotherapeutic drug is 
administered to a specific patient suffering from cancer, responsiveness of the 
patient are different and there are big differences such as significantly effective, 
lowly effective and ineffective at all. This is because of a possibility that, since 
genetic background is different for each patient, activity of excreting the 

10 chemotherapeutic drug out of the cancer cell is greatly different. Accordingly, 
the diagnosis method of this embodiment is quite useful for deciding what type of 
chemotherapeutic drug or chemotherapeutic drug group is to be administered 
and/or for deciding the effective dose of chemotherapeutic drug or 
chemotherapeutic drug group. In a preferred embodiment, the subjects having 

15 polymorphisms shown in Table 3 are suggested to be sensitive to the 
indolocarbazole compound represented by the formula (I). Accordingly, a 
therapy by an effective dose of the said compound can be applied to the patient 
suffering from cancer for which the above compound is effective and a significant 
improvement in therapeutic effect as well as a big reduction in side effect can be 

20 expected. 

In another embodiment of the present invention, there is provided a kit 
for diagnosing a drug sensitivity containing one or more of the polynucleotide, 
pair of primers, polypeptide, antibody and transformed cell of the present 
invention. The diagnosing kit may contain an appropriate package for a safe 
25 storage of the constituting reagents and a package insert for illustrating the 
method of the present invention. It may further contain an appropriate buffer, 
nucleotide, polymerase such as heat-resistant polymerase and fluorescent 
substance for the detection. 
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(Computer System) 

In another embodiment of the present invention, there is provided a 
computer system where at least one SNP(s) of the ABCG2 gene or at least one 
polymorphic mutant polypeptide sequence(s) concerning the ABCG2 
5 polypeptide is stored and displayed. This computer system includes an 
input/output device, a central processing unit and a readable storage medium 
(memory) where the aforementioned polymorphic sequence data are stored. 
The above polymorphic sequence data include nucleotide sequence, genetic 
type and haplotype of the ABCG2 gene in a subject population or amino acid 
10 sequence, spots by two-dimensional electrophoresis, mass spectrometry data, 
etc. of the ABCG2 polypeptide. These data are processed by various programs 
and can be used for determination of genetic type, linkage disequilibrium 
analysis, etc. In a preferred embodiment, result of those analyses can be 
used for the prediction of drug sensitivity of mammalian cells. 

15 

Examples 

The present invention is explained in more detail by reference to the 
following examples which are results of identification of single nucleotide 
polymorphisms of the ABCG2 gene using human genomic DNAs, and then 
20 preparation of cell lines expressing mutant ABCG2 polypeptides to analyze the 
function thereof. However, these examples do not restrict the scope of the 
present invention. 

[Example 1] Identification of SNPs in Human ABCG2 Gene 
25 The present inventors firstly extracted genomic DNAs from 30 human 

cancer cell lines and also from human clinical samples of 149 persons (whites) 
and identified the SNPs by sequencing the ABCG2 gene. 

The 30 cancer cell lines are A-427, DLD-1, NCI-H69, HeLa S3, PC-13, 
MKN-45, UM-UC-3, HCT116, PA-1, RT4, MKN1, SK-OV-3, MADH, KATOIII, 
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U118, HS746, T24, MSTO-211H, OVCR3, Lu135, Lx-1 , SCC25, Cal27, MKN-74, 
SCaBER, BxPC-3, Hela, J82, NCI-H 187 and ES-2. Genomic DNA was 
extracted from those cell lines with Trizol reagent (Gibco BRL). Human clinical 
samples were purchased from IMPATH-BCP Co. Nucleotide sequences of 

5 sixteen exons and peripheral introns of the ABCG2 gene were determined by 
direct sequencing. Firstly, sixteen exons were amplified from genomic DNA by 
PCR (LA Taq Takara) using each pair of primers shown in Table 1. Next, 
amplified DNA fragments were treated with ExoSAP-IT (USB corporation) to 
digest remaining primers and to remove unwanted dNTPs. Then, the DNA 

10 fragments were applied to cycle sequencing reaction with dye terminator method 
(Dynamic ET Dye Terminator Cycle Sequencing Kit; Amersham) using sense 
primers shown in Table 2. After the removal of dye-terminator by G-50 gel 
filtration column, the nucleotide sequences and SNPs were determined by 
capillary sequencer MegaBACE1000 (Molecular Dynamic). The identified 

15 SNPs were reconfirmed by sequencing using antisense primers shown in Table 
2. 
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Table 1 





rorwara rnrners 


Reverse Primers 


Exon 1 


5 -GTGCCCACTCAAAAGGTT-3 


S-TCCAGTCAAAGCTGTACTCTG-S' 


Exon 2 


5'-ATGTATTGTCACCTAGTGTTTG-3' 


5-AAAGTGTGAAGCCTTGAGCAGA-3' 


Exon 3 


5'-AACGGAGATGTTTCACAAGA -3* 


5-TACAATAAAGCCCCAAAACA -3' 


Exon 4 


5-GAGGAAAAAGAATGGGAGAA -3' 


5 -GTCTGCAAAGCCTGCTATAA-3' 


Exon 5 


5'-TTCCTTCACCTTTC I I I I CC-3' 


5'-CTTCCATAAAACTGGTCCCT-3' 


Exon 6 


5-GAGGTGCTTTGTATCAGGCT-3' 


5-GATCAGGCCAGTAGGTCAAC-3' 


Exon 7 


S-CTTGTAMTACTTGCAGATTACCTG-^ 


5-TGTTCAAGTGACAGAATAAATGGCT-3' 


Exon 8 


5-AAAGGGTAAAATTACGTGGG -3* 


5-GCAAACAAACTGACG I 1 1 IC-3' 


Exon 9 


S'-AATGAAGGTGTTAGGGAAGC-S* 


5-CTGGCTGACACTTCTTTCAC-3' 


Exon 10 


5-TCTCCCCAAAGCACAGATAACT-3' 


5'-CATTTAAAAATAATTGGGCCAGGTG-3' 


Exon 1 1 


5'-CTAATTACCTTCCAAAGGGC -3' 


5W\AACCAGGCTGCTCTTTACT-3' 


Exon 12 


5-GCTGGGTATTTTTCAAGGAT-3' 


5-AGAGAGTGCAAAATGGACAG-3* 


Exon 13 


5-TGCCTGTAGCTCTTCATCTC-3' 


S'-ACGAGAGGGAACCAAAATAG-S' 


Exon 14 


5 , -CTTTTTGGCAGCTTTAAATGATAGC-3 , 


5-AATCTTTCTCCTTTACTAGGAGGTA-3' 


Exon 15 


5-TTTACTTCI 1 1 1 GTATTGGAAGCCA-3' 


5'-TAGAGGATAAATCGATTGATAGGGA-3 t 


Exon 16 


5 -ATCTG AAGG GGTAATTATTAAAGGC-3' 


5-TGTTCC AG AAATGGTG CAAG AATTC-3* 


Table 2 




Sense Primers 


Antisense Primers 


Exon 1 


5'-GTGCCCACTCAAAAGGTT-3' 


S'-CAAGAGTTTTTACCAACCCA-S' 


Exon 2 


5-ATGTATTGTCACCTAGTGTTTG-3 


5-GTGGCCCAATTATTTCACT-3' 


Exon 3 


5'-TAAGAGTTGGTTTGTGCTTG-3' 


5-AACATGGTCAACTGCTACAT-3' 


Exon 4 


5'-ATGTTTTGGGGCTTTATTG-3' 


S'-TATTCCAGATTCTCCCTGC-S' 


Exon 5 


5-CAGGCTTTGCAGACATCTA-3 


5'-ATTGTTATGGAAAGCAACCA"3' 


Exon 6 


5-GAGGTGCTTTGTATCAGGCT-3' 


5'-CACCCTCATCACAGACATC-3' 


Exon 7 


5-CTGTCCTAGAATCTGCATTT-3' 


5-AGCTGGTGCTACAAAAAT-3' 


Exon 8 


5-AAAGGGTAAAATTACGTGGG -3' 


S-TCTGGTTGTTGCTTCCTACT-S' 


Exon 9 


5*-GTTAGGGAAGCATCCAAGA-3' 


5'-AGGGAAGCTTTCCAAAAGTA-3' 


Exon 10 


5'-TCTCCCCAAAGCACAGATAACT-3' 


5'-TGGTGGTGGATGTCTGTAGT-3* 


Exon 11 


5-CTAATTACCTTCCAAAGGGC-3' 


5*-GCTCAGGATTTTCTTCCCTA-3* 


Exon 12 


S'-CTGGACTGAGTGTTCAGGAG-S 1 


5'-AGAGAGTGCAAAATGGACAG-3' 


Exon 13 


5'-TGCCTGTAGCTCTTCATCTC-3' 


5-ATAAGGGCAAAGAGGAAAGT-3* 


Exon 14 


5'-TTTGTTCTTCCTTTAAAACCG-3' 


5-AATCTTTCTCCTTTACTAGGAGGTA-3' 


Exon 15 


5-TTTACTTC 1 1 1 1 GTATTGGAAGCCA-3' 


5-AAAAGGCCCAAAACAATAAG-3' 



Exon 16-1 5-ATCTGAAGGGGTAATTATTAAAGGC-3' 5'-CAGGAGTTTCCAGAATTCAA-3' 
Exon 16-2 5-TGTTGTTTTCTGTTCCCTTG-3' 5'-TGTTCCAGAAATGGTGCAAGAATTC-3' 

5 



Patent provided by Sughrue Mion, PLLC • http://www.sughrue.com 



WO 03/107249 



34 



PCT/JP03/07534 



Results of the identified SNPs in the 30 human cancer cell lines and in 
the human clinical samples for 149 persons (whites) on the basis of the 
aforementioned determination of nucleotide sequences are shown in Table 3. 
In the column for domain in Table 3, ABC means ATP binding cassette, EC 
5 means extra cellular region, TM means transmembrane region and UTR means 
untranslated region. For example, when a mutation site is shown by counting 
the first adenine in the translation initiation codon as the 1st one, G34A was 
found in five cell lines (16.7%) in 30 kinds of cancer cell lines and in 29 persons 
(19.5%) in human clinical samples of 149 persons. Incidentally, the mutation 

10 where 10th adenine from the 5-side of intron 3 is substituted with guanine is 
shown as "A+10G" and the mutation where 21st cytosine from the 3-site of 
intron 13 is substituted with thymine is shown as "C-21T\ Positions of some 
SNPs in Table 3 are shown in Fig. 1 together with a schematic structure of the 
ABCG2 polypeptide. Among those SNPs, G34A was present in a leader 

15 sequence which is important for localization of the ABCG2 polypeptide to a cell 
membrane and C421A was a mutation existing in an ATP binding cassette 
(ABC) region being important for binding to ATP which is transportation energy. 
Accordingly, those mutations have a high possibility of affecting the activity of the 
ABCG2 polypeptide. C376T is a mutation to termination codon existing in the 

20 ABC region and the fact that ABCG2 loses its activity is clear. Incidentally, 
each SNP for C496G, T623C, A1444G and G1445C is reported in NCBI SNP 
CLUSTER ID: rs 1061017, NCBI SNP CLUSTER ID: rs1061018, Cancer Res. 
59, 8-13, 1999 and Proc. Natl. Acad. ScL USA, 95, 15665-15670, 1998, 
respectively but they were not detected in the aforementioned cell lines and 

25 human-derived samples. 
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Table 3 



SNPs 


Influence on 


Existing 


Domain 


Frequency in 


Frequency in 




Amino Acid 


Position 




30 Cell Lines 


149 Human 




Substitution 








Clinical 












Samples 


G34C 


Val1 2 Met 


Exon 2 


Leader 


5 (16.7%) 


29 (19.5%) 








Sequence 


A+10G 


- 


Intron 3 


- 


ND 


25 (16.8%) 


C369T 


Tyr123Tyr 


Exon 4 


ABC 


0 


1 (0.7%) 


C376T 


Gln126Term 


Exon 4 


ABC 


1 (3.3%) 


0 


C421A 


Gln141Lys 


Exon 5 


ABC 


6 (20%) 


24 (16.1%) 


C458T 


Thr153Met 


Exon 5 


ABC 


1 (3.3%) 


0 


C474T 


Asp158Asp 


Exon 5 


ABC 


0 


1 (0.7%) 


C496G 


Gln166Glu 


Exon 5 


ABC 


0 


0 


T623C 


Phe208Ser 


Exon 6 


ABC 


0 


0 


A+20G 




Intron 11 




ND 


44 (29.5%) 


A1444G 


Arg482Gly 


Exon 12 


TM3 


0 


0 


G1445C 


Arg482Thr 


Exon 12 


TM3 


0 


0 


C-21T 




Intron 13 




ND 


36 (24.2%) 


A1768T 


Asp590Tyr 


Exon 15 


EC3 


0 


1 (0.7%) 


G2237T 




Exon 16 


3'UTR 


1 (3.3%) 


0 


G2393T 




Exon 16 


3'UTR 


1 (3.3%) 


0 



[Example 2] Preparation of Cell Lines Expressing mutated ABCG2 
5 Among the polymorphic mutations identified in Example 1, two 

mutations - G34A and C421 A - having a high possibility to affect the function of 
the ABCG2 polypeptide were prepared and introduced into animal cells as an 
endeavor to analyze their functions. Preparation of the mutated ABCG2 genes 
was conducted by PCR and a point mutation was introduced. After confirming 
10 the introduction of the target mutations by sequencing, the mutated genes were 
cloned into Hind\\\ and Xho\ sites of an expression vector pcDNA3.1(+) and 
expression plasmid for each mutant was prepared. As a control, a plasmid 
expressing the wild type (WT) ABCG2 and the vector plasmid pcDNA3.1(+) 
alone without the ABCG2 gene were used and those four kinds of expression 
15 plasmids were introduced to an animal cell (porcine kidney cell line) LLC-PK1 by 
lipofection method (Lipofectamine; Gibco BRL). Stable transfectants were 
selected with 1 ,500 ng/ml of Geneticin (Gibco BRL) for two weeks and cell lines 
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were established. To determine the expressed level of ABCG2 in each cell line, 
total RNA was extracted from each transfectant cells and HeLa cells with Trizol 
(Gibco BRL). Seven microgram of the total RNA extracted from each clone of 
cell and full length ABCG2 cDNA (2.2 kb) probe labeled with 32 P were used for 
5 Northern hybridization. Several transfectants which expressed equal amount of 
ABCG2 mRNA were selected to eliminate the effect of expression level and the 
result is shown in Fig. 2. It is noted that, in Fig. 2, each of the transfectant cells 
of lanes 2-4 expressed ABCG2 mRNA in equal amount. Incidentally, GAPDH 
was used as an internal standard for mRNA expressed in each ceil. It was 
10 noted that the lane 5 was a control clone which was transfected by vector alone 
and no ABCG2 mRNA was expressed. 

[Example 3] Evaluation of Resistance to Compound B 

The transfectant cells which were selected in Example 2 and in which 

15 nearly equal amount of ABCG2 mRNA was expressed were incubated 
(cultivated) in a 199 medium containing 1 mM of L-glutamine, 50 units/ml of 
penicillin, 50 mg/ml of streptomycin and 10% by volume of fetal bovine serum. 
All of the incubations were carried out at 37°C under the humidified atmosphere 
containing 5% of carbon dioxide. The cytotoxicity of anticancer drugs was 

20 determined by sulforhodamine B dye-staining method and compared with each 
other. Specifically, four kinds of transformed cell clones were cultured at 37°C 
for 72 hours in a medium containing Compounds B or camptothecin of various 
concentrations, then fixed with trichloroacetic acid and stained for 30 minutes 
with 0.4% sulforhodamine B dissolved in 1% acetic acid solution. After 

25 unbound dye was removed by four washes with 1% acetic acid, polypeptide- 
bound dye was extracted with 10 mM unbuffered Tris base. Then, optical 
density of the extract was measured in a plate reader at 564 nm and 50% 
inhibitory concentration (IC 50 ) values for cell viability were determined. The 
results are shown in Table 4 . In the cell line (1-58) expressing the wild type 



Patent provided by Sughrue Mion, PLLC - http://www.sug hrue.com 



WO 03/107249 



37 



PCT/JP03/07534 



ABCG2, resistance to Compound B has been increased to an extent of 400-fold 
or more as compared with the cell line (C4) which was transfected with vector 
alone. On the contrary, the resistance of the cell line (2-51) having a mutation 
of Val12Met in the leader sequence of ABCG2 or the cell line (3-28) having a 

5 mutation of Gln141Lys in the ABC region to Compound B increased to an extent 
of 7.7-fold and 48.2-fold, respectively, as compared with C4 .however, the 
resistance as compared with that of wild type was about 1/10 or less. To 
camptothecin which is not a substrate for ABCG2, there was no significant 
difference in terms of the resistance among the cells. From these results, it 

10 was suggested that, the two kinds of mutant ABCG2 (Val12Met and Gln141Lys) 
obviously have a decreased capability of excreting Compound B which is a 
topoisomerase inhibitor out of the cell as compared with wild type ABCG2. 



Table 4 Influence of SNPs on Sensitivity to Compound B 



Clone No. 


C4 


1-58 


2-51 


3-28 


SNP Site 




Wild Type 


Leader Sequence 
(Val12Met) 


ABC Domain 
(Gln141Lys) 


Expression Level* 


0 


1.90 


2.08 


1.95 


ICso (nM Camptothecin) 


0.0087 


0.0213 


0.0122 


0.0268 


ICgo (nM Compound B) 


0.122 


>50 


0.94 


5.88 


Increased Rate 


1.0 


>409 


7.7 


48.2 



15 *: Expression level of ABCG2 of each cell line was standardized to HeLa cell (= 1.0). 

Industrial Applicability 

By using the method of the present invention, a drug transport capability 
of a mammalian cell can be predicted whereby sensitivity of a patient to various 

20 drugs such as anti-cancer drugs can be diagnosed and an indicator for the 
therapy can be obtained. In other words, as a result of selecting an anti-cancer 
drug in cancer therapy and, particularly, detecting a cancer cell(s) which is highly 
sensitive to indolocarbazole compounds, it is now possible to selectively apply 
the said compounds for the therapy. In addition, the optimum dose of the 

25 indolocarbazole compounds in the cancer therapy is found and, at the same time, 
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side effect of the compounds is reduced whereby a highly effective method of 
using the indolocarbazole compounds is provided. 
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CLAIMS 

1 . A method for predicting a drug transport capability of a mammalian cell 
comprising the steps of: 

5 collecting a sample from a mammal, 

determining a polymorphism of the nucleotide sequence of ABCG2 gene 
or a polymorphism of the amino acid sequence of ABCG2 polypeptide. 

2. The method of claim 1, wherein said ABCG2 gene comprises a DNA 
10 consisting of the nucleotide sequence of SEQ ID NO:1, and said polymorphism 

of the nucleotide sequence is one or more of single nucleotide polymorphisms at 
positions selected from the group consisting of 34, 376 and 421 of SEQ ID NO:1 . 

3. The method of claim 2 f wherein said single nucleotide polymorphism is 
15 selected from the group consisting of G34A, C376T and C421 A. 

4. The method of claim 2, wherein said polymorphism of the nucleotide 
sequence is determined by any one of methods selected from the group 
consisting of a direct sequencing method, TaqMan method, invader method, 

20 mass spectrometry, RCA method and DNA chip method. 

5. The method of claim 1 , wherein said ABCG2 polypeptide comprises an 
amino acid sequence of SEQ ID NO:2, and said polymorphism of the amino acid 
sequence is one or more of amino acid polymorphisms at positions selected 

25 from the group consisting of 12, 126, and 141 of SEQ ID NO:2. 

6. The method of claim 5, wherein said amino acid polymorphism is an 
amino acid substitution of Val12Met or Gln141Lys, or a deletion of the amino 
acid sequence downstream from the position 126 of SEQ ID NO:2. 
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7. The method of claim 5, wherein said polymorphism of the amino acid 
sequence is determined by any one of methods selected from the group 
consisting of mass spectrometry, two-dimensional electrophoresis method, and 

5 protein chip method. 

8. The method of any one of claims 1 to 7, wherein said drug is a 

compound represented by the following general formula (I): 

R 



halogen atom or hydroxyl group, 

R represents a hydrogen atom, amino, formylamino, or lower alkylamino 
which may be substituted with any one selected from the group consisting of one 
to three hydroxyl group(s), a pyridyl group optionally having substituent(s), and 
15 thienyl group optionally having substituent(s), and 

G represents a pentose group or hexose group or derivative thereof 
which may be substituted with an amino group. 

9. A polynucleotide having a single nucleotide polymorphism(s) at one or 
20 more position(s) selected from the group consisting of 34, 376 and 421 of SEQ 




G 
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ID IMO:1, said polynucleotide comprising any one of the positions of said single 
nucleotide polymorphisms and consisting of at least 10 contiguous nucleotides 
of SEQ ID NO:1 , or a complementary polynucleotide thereto. 

5 10. The polynucleotide of claim 9, wherein said single nucleotide 
polymorphism is selected from the group consisting of G34A, C376T, C421 A and 
single nucleotide polymorphisms complementary thereto. 

11 . A polynucleotide having one or more of the nucleotide polymorphisms in 
10 the polynucleotide sequence of SEQ ID NO:1, said polymorphism selected from 

the group consisting of nucleotide polymorphisms by which the translated amino 
acid at position 12 is methionine, one at position 126 is stop codon, and one at 
position 141 is lysine, and consisting of at least 10 contiguous nucleotides 
including one or more of nucleotides located at the site of said nucleotide 
15 polymorphisms, or a complementary polynucleotide thereto. 

12. A pair of PCR primers which specifically hybridize to ABCG2 gene, and 
amplify a DNA fragment of a portion of said gene, wherein the amplified DNA 
fragment comprises a nucleotide(s) at position 34, 376 or 421 of SEQ ID NO:1 

20 

13. The pair of PCR primers of claim 12, said pair of PCR primers selected 
from the group of: 

SEQ ID NO:5 and SEQ ID NO:6; SEQ ID NO:9 and SEQ ID NO:10; and 
SEQ ID NO:11 and SEQ ID NO:12. 

25 

14. A polynucleotide which specifically hybridizes to ABCG2 gene, and 
which is capable of detecting a polymorphism(s) of ABCG2 gene at position 34, 
376 or 421 ofSEQIDNO:1. 
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15. The polynucleotide of claim 14, which is capable of using in any one of 
methods selected from the group consisting of a direct sequencing method, 
TaqMan method, invader method, mass spectrometry, RCA method and DNA 
chip method. 

5 

16. A polypeptide having polymorphic mutation(s) to ABCG2 protein defined 
in the following (a) or (b), said polypeptide is a polymorphic mutant wherein one 
or both of amino acids at positions 12 and 141 of SEQ ID NO:2 are substituted 
with other amino acid(s), a polypeptide fragment comprising said substituted 

10 amino acid and at least 10 contiguous amino acid residues of said polymorphic 
mutant, or a polypeptide wherein the amino acid sequence downstream from the 
position 126 of SEQ ID NO:2 is deleted: 

(a) a human ABCG2 polypeptide consisting of an amino acid sequence 
of SEQ ID No: 2, 

15 (b) an isopolypeptide of (a) consisting of an amino acid sequence of 

SEQ ID NO:2, wherein one or several amino acids except for the amino acids at 
positions 12, 126 and 141, are deleted, substituted or added, and having a drug 
transport capability. 

20 17. An antibody which specifically binds to the mutant ABCG2 polypeptide 
of claim 16. 

18. A transformed cell which expresses an polypeptide having polymorphic 
mutation(s) to ABCG2 protein defined in the following (a) or (b), said 
25 polymorphic mutation(s) being one or both of amino acid substitutions Val12Met 
and Gln141Lys of the amino acid sequence of SEQ ID NO:2: 

(a) a human ABCG2 polypeptide consisting of an amino acid sequence 
of SEQ ID NO:2, 

(b) an isopolypeptide of (a) consisting of an amino acid sequence of 
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SEQ ID NO:2, wherein one or several amino acids except for the amino acids at 
positions 12, 126 and 141, are deleted, substituted or added, and having a drug 
transport capability. 

5 19. A method for measuring a drug transport capability using the 
transformed cell of claim 18. 

20. A method for diagnosing a drug sensitivity comprising the steps of: 
analyzing a biological sample from a subject, and determining the presence or 

10 absence of a polynucleotide of any one of claims 9 to 11, or a polypeptide of 
claim 16. 

21 . The method of claim 20, wherein the subject having said polynucleotide 

and/or said polypeptide is suggested to be sensitive to the compound 

15 represented by the following general formula (I): 

R 




(I) 

wherein X 1 and X 2 each independently represent a hydrogen atom, 
halogen atom or hydroxyl group, 

R represents a hydrogen atom, amino, formylamino, or lower alkylamino 
20 which may be substituted with any one selected from the group consisting of one 
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to three hydroxyl group(s), a pyridyl group optionally having substituent(s), and 
thienyl group optionally having substituent(s), and 

G represents a pentose group or hexose group or derivative thereof 
which may be substituted with an amino group. 

22. A kit for diagnosing a drug sensitivity comprising one or more of the 
following (a) to (f): 

(a) the polynucleotide of any one of claims 9 to 1 1 , 

(b) the pair of primers of claim 1 2 or 1 3, 

(c) the polynucleotide of claim 14 or 15, 

(d) the polypeptide of claim 1 6, 

(e) the antibody of claim 17, and 

(f) the transformed cell of claim 18. 

23. A computer system for analyzing data of ABCG2 polymorphism, 
comprising: 

(a) an input/output device, 

(b) a memory containing the polymorphism data, and 

(c) a central processing unit. 
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SEQUENCE LISTING 
<110> BANYU PHARMACEUTICAL CO., LTD 

<120>' Method for predicting a drug transport capability by ABCG2 polymorphisms 

<130> P2725PCT-GN 

<150> JP 2002-175806 
<151> 2002-06-17 

<160> 68 

<170> Patentln version 3. 1 

<210> 1 

<211> 1968 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1).. (1965) 

<223> 

<400> 1 

atg tct tec agt aat gtc gaa gtt ttt ate cca gtg tea caa gga aac 48 
Met Ser Ser Ser Asn Val Glu Val Phe He Pro Val Ser Gin Gly Asn 
15 10 15 

acc aat ggc ttc ccc gcg aca get tec aat gac ctg aag gca ttt act 96 
Thr Asn Gly Phe Pro Ala Thr Ala Ser Asn Asp Leu Lys Ala Phe Thr 
20 25 30 

gaa gga get gtg tta agt ttt cat aac ate tgc tat cga gta aaa ctg 144 
Glu Gly Ala Val Leu Ser Phe His Asn lie Cys Tyr Arg Val Lys Leu 
35 40 45 

aag agt ggc ttt eta cct tgt cga aaa cca gtt gag aaa gaa ata tta 192 
Lys Ser Gly Phe Leu Pro Cys Arg Lys Pro Val Glu Lys Glu He Leu 
50 55 60 

teg aat ate aat ggg ate atg aaa cct ggt etc aac gec ate ctg gga 240 
Ser Asn He Asn Gly lie Met Lys Pro Gly Leu Asn Ala He Leu Gly 
65 70 75 80 

ccc aca ggt gga ggc aaa tct teg tta tta gat gtc tta get gca agg 288 
Pro Thr Gly Gly Gly Lys Ser Ser Leu Leu Asp Val Leu Ala Ala Arg 
85 90 95 

aaa gat cca agt gga tta tct gga gat gtt ctg ata aat gga gca ccg 336 
Lys Asp Pro Ser Gly Leu Ser Gly Asp Val Leu He Asn Gly Ala Pro 
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100 105 110 

cga cct gcc aat ttc aaa tgt aat tea ggt tac gtg gta caa gat gat 384 
Arg Pro Ala Asn Phe Lys Cys Asn Ser Gly Tyr Val Val Gin Asp Asp 
115 120 125 

gtt gtg atg ggc act ctg acg gtg aga gaa aac tta cag ttc tea gca 432 
Val Val Met Gly Thr Leu Thr Val Arg Glu Asn Leu Gin Phe Ser Ala 
130 135 140 

get ctt egg ctt gca aca act atg acg aat cat gaa aaa aac gaa egg 480 
Ala Leu Arg Leu Ala Thr Thr Met Thr Asn His Glu Lys Asn Glu Arg 
145 150 155 160 

att aac agg gtc att caa gag tta ggt ctg gat aaa gtg gca gac tec 528 
He Asn Arg Val lie Gin Glu Leu Gly Leu Asp Lys Val Ala Asp Ser 
165 170 175 

aag gtt gga act cag ttt ate cgt ggt gtg tct gga gga gaa aga aaa 576 
Lys Val Gly Thr Gin Phe He Arg Gly Val Ser Gly Gly Glu Arg Lys 
180 185 190 

agg act agt ata gga atg gag ctt ate act gat cct tec ate ttg ttc 624 
Arg Thr Ser He Gly Met Glu Leu He Thr Asp Pro Ser He Leu Phe 
195 200 205 

ttg gat gag cct aca act ggc tta gac tea age aca gca aat get gtc 672 
Leu Asp Glu Pro Thr Thr Gly Leu Asp Ser Ser Thr Ala Asn Ala Val 
210 215 220 

ctt ttg etc ctg aaa agg atg tct aag cag gga cga aca ate ate ttc 720 
Leu Leu Leu Leu Lys Arg Met Ser Lys Gin Gly Arg Thr He He Phe 
225 230 235 240 

tec att cat cag cct cga tat tec ate ttc aag ttg ttt gat age etc 768 
Ser He His Gin Pro Arg Tyr Ser lie Phe Lys Leu Phe Asp Ser Leu 
245 250 255 

ace tta ttg gcc tea gga aga ctt atg ttc cac ggg cct get cag gag 816 
Thr Leu Leu Ala Ser Gly Arg Leu Met Phe His Gly Pro Ala Gin Glu 
260 265 270 

gcc ttg gga tac ttt gaa tea get ggt tat cac tgt gag gcc tat aat 864 
Ala Leu Gly Tyr Phe Glu Ser Ala Gly Tyr His Cys Glu Ala Tyr Asn 
275 280 285 

aac cct gca gac ttc ttc ttg gac ate att aat gga gat tec act get 912 
Asn Pro Ala Asp Phe Phe Leu Asp He lie Asn Gly Asp Ser Thr Ala 
290 295 300 

gtg gca tta aac aga gaa gaa gac ttt aaa gcc aca gag ate ata gag 960 
Val Ala Leu Asn Arg Glu Glu Asp Phe Lys Ala Thr Glu lie He Glu 
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305 310 315 320 

cct tec aag cag gat aag cca etc ata gaa aaa tta gcg gag att tat 1008 

Pro Ser Lys Gin Asp Lys Pro Leu He Glu Lys Leu Ala Glu He Tyr 
325 330 335 

gtc aac tec tec ttc tac aaa gag aca aaa get gaa tta cat caa ctt 1056 

Val Asn Ser Ser Phe Tyr Lys Glu Thr Lys Ala Glu Leu His Gin Leu 
340 345 350 

tec ggg ggt gag aag aag aag aag ate aca gtc ttc aag gag ate age 1104 

Ser Gly Gly Glu Lys Lys Lys Lys He Thr Val Phe Lys Glu lie Ser 
355 360 365 

tac ace acc tec ttc tgt cat caa etc aga tgg gtt tec aag cgt tea 1152 

Tyr Thr Thr Ser Phe Cys His Gin Leu Arg Trp Val Ser Lys Arg Ser 

370 375 380 

ttc aaa aac ttg ctg ggt aat ccc cag gee tct ata get cag ate att 1200 

Phe Lys Asn Leu Leu Gly Asn Pro Gin Ala Ser lie Ala Gin He He 

385 390 395 400 

gtc aca gtc gta ctg gga ctg gtt ata ggt gec att tac ttt ggg eta 1248 

Val Thr Val Val Leu Gly Leu Val He Gly Ala He Tyr Phe Gly Leu 
405 410 415 

aaa aat gat tct act gga ate cag aac aga get ggg gtt etc ttc ttc 1296 

Lys Asn Asp Ser Thr Gly He Gin Asn Arg Ala Gly Val Leu Phe Phe 
420 425 430 

ctg acg acc aac cag tgt ttc age agt gtt tea gee gtg gaa etc ttt 1344 

Leu Thr Thr Asn Gin Cys Phe Ser Ser Val Ser Ala Val Glu Leu Phe 
435 440 445 

gtg gta gag aag aag etc ttc ata cat gaa tac ate age gga tac tac 1392 

Val Val Glu Lys Lys Leu Phe He His Glu Tyr He Ser Gly Tyr Tyr 

450 455 460 

aga gtg tea tct tat ttc ctt gga aaa ctg tta tct gat tta tta ccc 1440 

Arg Val Ser Ser Tyr Phe Leu Gly Lys Leu Leu Ser Asp Leu Leu Pro 

465 470 475 480 

atg agg atg tta cca agt att ata ttt acc tgt ata gtg tac ttc atg 1488 

Met Arg Met Leu Pro Ser He He Phe Thr Cys He Val Tyr Phe Met 
485 490 495 

tta gga ttg aag cca aag gca gat gee ttc ttc gtt atg atg ttt acc 1536 

Leu Gly Leu Lys Pro Lys Ala Asp Ala Phe Phe Val Met Met Phe Thr 
500 505 510 

ctt atg atg gtg get tat tea gee agt tec atg gca ctg gee ata gca 1584 

Leu Met Met Val Ala Tyr Ser Ala Ser Ser Met Ala Leu Ala He Ala 
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gca ggt cag agt gtg gtt tct gta gca aca ctt etc atg acc ate tgt 1632 
Ala Gly Gin Ser Val Val Ser Yal Ala Thr Leu Leu Met Thr He Cys 
530 535 540 

ttt gtg ttt atg atg att ttt tea ggt ctg ttg gtc aat etc aca acc 1680 
Phe Val Phe Met Met He Phe Ser Gly Leu Leu Val Asn Leu Thr Thr 
545 550 555 560 

att gca tct tgg ctg tea tgg ctt cag tac ttc age att cca cga tat 1728 
He Ala Ser Trp Leu Ser Trp Leu Gin Tyr Phe Ser He Pro Arg Tyr 
565 570 575 

gga ttt acg get ttg cag cat aat gaa ttt ttg gga caa aac ttc tgc 1776 
Gly Phe Thr Ala Leu Gin His Asn Glu Phe Leu Gly Gin Asn Phe Cys 
580 . 585 590 

cca gga etc aat gca aca gga aac aat cct tgt aac tat gca aca tgt 1824 
Pro Gly Leu Asn Ala Thr Gly Asn Asn Pro Cys Asn Tyr Ala Thr Cys 
595 600 605 

act ggc gaa gaa tat ttg gta aag cag ggc ate gat etc tea ccc tgg 1872 
Thr Gly Glu Glu Tyr Leu Val Lys Gin Gly He Asp Leu Ser Pro Trp 
610 615 620 

ggc ttg tgg aag aat cac gtg gee ttg get tgt atg att gtt att ttc 1920 
Gly Leu Trp Lys Asn His Val Ala Leu Ala Cys Met He Val He Phe 
625 630 635 640 

etc aca att- gec tac ctg aaa ttg tta ttt ctt aaa aaa tat tct taa 1968 
Leu Thr He Ala Tyr Leu Lys Leu Leu Phe Leu Lys Lys Tyr Ser 
645 650 655 



<210> 2 

<211> 655 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Ser Ser Ser Asn Val Glu Val Phe lie Pro Val Ser Gin Gly Asn 
15 10 15 



Thr Asn Gly Phe Pro Ala Thr Ala Ser Asn Asp Leu Lys Ala Phe Thr 
20 25 30 



Glu Gly Ala Val Leu Ser Phe His Asn He Cys Tyr Arg Val Lys Leu 
35 40 45 
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Lys Ser Gly Phe Leu Pro Cys Arg Lys Pro Val Glu Lys Glu He Leu 
50 55 60 



Ser Asn He Asn Gly He Met Lys Pro Gly Leu Asn Ala He Leu Gly 
65 70 75 80 



Pro Thr Gly Gly Gly Lys Ser Ser Leu Leu Asp Yal Leu Ala Ala Arg 
85 90 95 



Lys Asp Pro Ser Gly Leu Ser Gly Asp Val Leu He Asn Gly Ala Pro 
100 105 110 



Arg Pro Ala Asn Phe Lys Cys Asn Ser Gly Tyr Val Val Gin Asp Asp 
115 120 125 



Val Val Met Gly Thr Leu Thr Val Arg Glu Asn Leu Gin Phe Ser Ala 
130 135 140 



Ala Leu Arg Leu Ala Thr Thr Met Thr Asn His Glu Lys Asn Glu Arg 
145 150 155 160 



He Asn Arg Val He Gin Glu Leu Gly Leu Asp Lys Val Ala Asp Ser 
165 170 175 



Lys Val Gly Thr Gin Phe He Arg Gly Val Ser Gly Gly Glu Arg Lys 
180 185 190 



Arg Thr Ser He Gly Met Glu Leu He Thr Asp Pro Ser He Leu Phe 
195 200 205 



Leu Asp Glu Pro Thr Thr Gly Leu Asp Ser Ser Thr Ala Asn Ala Yal 
210 215 220 



Leu Leu Leu Leu Lys Arg Met Ser Lys Gin Gly Arg Thr He He Phe 
225 230 235 240 



Ser lie His Gin Pro Arg Tyr Ser lie Phe Lys Leu Phe Asp Ser Leu 
245 250 255 
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Thr Leu Leu Ala Ser Gly Arg Leu Met Phe His Gly Pro Ala Gin Glu 
260 265 270 



Ala Leu Gly Tyr Phe Glu Ser Ala Gly Tyr His Cys Glu Ala Tyr Asn 
275 280 285 



Asn Pro Ala Asp Phe Phe Leu Asp He He Asn Gly Asp Ser Thr Ala 
290 295 300 



Val Ala Leu Asn Arg Glu Glu Asp Phe Lys Ala Thr Glu He He Glu 
305 310 315 320 



Pro Ser Lys Gin Asp Lys Pro Leu lie Glu Lys Leu Ala Glu lie Tyr 
325 330 335 



Val Asn Ser Ser Phe Tyr Lys Glu Thr Lys Ala Glu Leu His Gin Leu 
340 345 350 



Ser Gly Gly Glu Lys Lys Lys Lys He Thr Val Phe Lys Glu lie Ser 
355 360 365 



Tyr Thr Thr Ser Phe Cys His Gin Leu Arg Trp Val Ser Lys Arg Ser 
370 375 380 



Phe Lys Asn Leu Leu Gly Asn Pro Gin Ala Ser lie Ala Gin He lie 
385 390 395 400 



Val Thr Val Val Leu Gly Leu Val He Gly Ala He Tyr Phe Gly Leu 
405 410 415 



Lys Asn Asp Ser Thr Gly He Gin Asn Arg Ala Gly Val Leu Phe Phe 
420 425 430 



Leu Thr Thr Asn Gin Cys Phe Ser Ser Val Ser Ala Val Glu Leu Phe 
435 440 445 



Val Val Glu Lys Lys Leu Phe He His Glu Tyr He Ser Gly Tyr Tyr 
450 455 460 
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Arg Val Ser Ser Tyr Phe Leu Gly Lys Leu Leu Ser Asp Leu Leu Pro 
465 470 475 480 



Met Arg Met Leu Pro Ser He He Phe Thr Cys He Val Tyr Phe Met 
485 490 495 



Leu Gly Leu Lys Pro Lys Ala Asp Ala Phe Phe Val Met Met Phe Thr 
500 505 510 



Leu Met Met Val Ala Tyr Ser Ala Ser Ser Met Ala Leu Ala lie Ala 
515 520 525 



Ala Gly Gin Ser Val Val Ser Val Ala Thr Leu Leu Met Thr lie Cys 
530 535 540 



Phe Val Phe Met Met lie Phe Ser Gly Leu Leu Val Asn Leu Thr Thr 
545 550 555 560 



He Ala Ser Trp Leu Ser Trp Leu Gin Tyr Phe Ser He Pro Arg Tyr 
565 570 575 



Gly Phe Thr Ala Leu Gin His Asn Glu Phe Leu Gly Gin Asn Phe Cys 
580 585 590 



Pro Gly Leu Asn Ala Thr Gly Asn Asn Pro Cys Asn Tyr Ala Thr Cys 
595 600 605 



Thr Gly Glu Glu Tyr Leu Val Lys Gin Gly He Asp Leu Ser Pro Trp 
610 615 620 



Gly Leu Trp Lys Asn His Val Ala Leu Ala Cys Met He Val He Phe 
625 630 635 640 



Leu Thr He Ala Tyr Leu Lys Leu Leu Phe Leu Lys Lys Tyr Ser 
645 650 655 



<210> 3 
<211> 18 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 1 forward primer 
<400> 3 

gtgcccactc aaaaggtt 18 



<210> 4 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 1 reverse primer 

<400> 4 

tccagtcaaa gctgtactct g 21 



<210> 5 

<211> 22 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 2 forward primer 

<400> 5 

atgtattgtc acctagtgtt tg 22 



<210> 6 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 2 reverse primer 

<400> 6 

aaagtgtgaa gccttgagca ga 22 



<210> 7 

<2U> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 
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<223> Exon 3 forward primer 



<400> 7 

aacggagatg tttcacaaga 



20 



<210> 8 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 3 reverse primer 

<400> 8 

tacaataaag ccccaaaaca 20 



<210> 9 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 4 forward primer 



<210> 10 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 4 reverse primer 

<400> 10 

gtctgcaaag cctgctataa 20 



<210> 11 

<211> 20 

<212> DNA 

<213> Artificial SeQuence 
<220> 

<223> Exon 5 forward primer 



<400> 9 

gaggaaaaag aatgggagaa 



20 



<400> 11 

ttccttcacc tttcttttcc 



20 
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<210> 12 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 5 reverse primer 

<400> 12 

cttccataaa actggtccct 20 



<210> 13 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 6 forward primer 



<210> 14 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 6 reverse primer 

<400> 14 

gatcaggcca gtaggtcaac 20 



<210> 15 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 7 forward primer 



<400> 13 

gaggtgcttt gtatcaggct 



20 



<400> 15 

cttgtaaata cttgcagatt acctg 



25 



<210> 16 
<211> 25 
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<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 7 reverse primer 
<400> 16 

tgttcaagtg acagaataaa tggct 25 



<210> 17 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 8 forward primer 



<210> 18 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 8 reverse primer 

<400> 18 

gcaaacaaac tgacgttttc 20 



<210> 19 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 9 forward primer 



<210> 20 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 



<400> 17 

aaagggtaaa attacgtggg 



20 



<400> 19 

aatgaaggtg ttagggaagc 



20 
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<223> Exon 9 reverse primer 



<400> 20 

ctggctgaca cttctttcac 



20 



<210> 21 

<211> 22 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 10 forward primer 

<400> 21 

tctccccaaa gcacagataa ct 22 



<210> 22 

<211> 25 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 10 reverse primer 



<210> 23 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 11 forward primer 

<400> 23 

ctaattacct tccaaagggc 20 



<210> 24 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 11 reverse primer 



<400> 22 

catttaaaaa taattgggcc aggtg 



25 



<400> 24 

aaaccaggct gctctttact 



20 
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<210> 25 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 12 forward primer 

<400> 25 

gctgggtatt tttcaaggat 20 



<210> 26 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 12 reverse primer 



<210> 27 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 13 forward primer 

<400> 27 

tgcctgtagc tcttcatctc 20 



<210> 28 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 13 reverse primer 



<400> 26 

agagagtgca aaatggacag 



20 



<400> 28 

acgagaggga accaaaatag 



20 



<210> 29 
<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 14 forward priier 
<400> 29 

ctttttggca gctttaaatg atagc 25 



<210> 30 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 14 reverse primer 



<210> 31 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 15 forward primer 

<400> 31 

tttacttctt ttgtattgga agcca 25 



<210> 32 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 15 reverse primer 



<210> 33 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 



<400> 30 

aatctttctc ctttactagg aggta 



25 



<400> 32 

tagaggataa atcgattgat aggga 



25 
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<223> Exon 16 forward primer 



<400> 33 

atctgaaggg gtaattatta aaggc 



25 



<210> 34 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 16 reverse primer 

<400> 34 

tgttccagaa atggtgcaag aattc 25 



<210> 35 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 1 sense primer 



<210> 36 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 1 antisense primer 

<400> 36 

caagagtttt taccaaccca 20 



<210> 37 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 2 sense primer 



<400> 35 

gtgcccactc aaaaggtt 



18 



<400> 37 

atgtattgtc acctagtgtt tg 



22 
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<210> 38 

<211> 19 

<212> DNA 

<213> Artificial Seciuence 
<220> 

<223> Exon 2 an ti sense primer 

<400> 38 

gtggcccaat tatttcact 19 



<210> 39 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 3 sense primer 



<210> 40 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 3 antisense primer 

<400> 40 

aacatggtca actgctacat 20 



<210> 41 

<211> 19 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 4 sense primer 



<400> 39 

taagagttgg tttgtgcttg 



20 



<400> 41 

atgttttggg gctttattg 



19 



<210> 42 
<211> 19 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 4 ant i sense primer 
<400> 42 

tattccagat tctccctgc 19 



<210> 43 

<211> 19 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 5 sense primer 



<210> 44 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 5 antisense primer 

<400> 44 

attgttatgg aaagcaacca 20 



<210> 45 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 6 sense primer 



<210> 46 

<211> 19 

<212> DNA 

<213> Artificial Seauence 
<220> 



<400> 43 

caggctttgc agacatcta 



19 



<400> 45 

gaggtgcttt gtatcaggct 



20 
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<223> Exon 6 antisense primer 



<400> 46 

caccctcatc acagacatc 



19 



<210> 47 

<211> 20 

<212> DNA 

<213> Artificial Seauence 

mo> 

<223> Exon 7 sense primer 

<400> 47 

ctgtcctaga atctgcattt 20 



<210> 48 

<211> 18 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 7 antisense primer 



<210> 49 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 8 sense primer 

<400> 49 

aaagggtaaa attacgtggg 20 



<210> 50 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 8 antisense primer 



<400> 48 

agctggtgct acaaaaat 



18 



<400> 50 

tctggttgtt gcttcctact 



20 
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<210> 51 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 9 sense primer 

<400> 51 

gttagggaag catccaaga 19 



<210> 52 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 9 antisense primer 



<210> 53 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 10 sense primer 

<400> 53 

tctccccaaa gcacagataa ct 22 



<210> 54 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 10 antisense primer 



<400> 52 

agggaagctt tccaaaagta 



20 



<400> 54 

tggtggtgga tgtctgtagt 



20 



<210> 55 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 11 sense primer 
<400> 55 

ctaattacct tccaaagggc 20 



<210> 56 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 11 antisense primer 



<210> 57 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 12 sense primer 

<400> 57 

ctggactgag tgttcaggag 20 



<210> 58 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 12 antisense primer 



<210> 59 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 



<400> 56 

gctcaggatt ttcttcccta 



20 



<400> 58 

agagagtgca aaatggacag 



20 
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<223> Exon 13 sense primer 



<400> 59 

tgcctgtagc tcttcatctc 



20 



<210> 60 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 13 antisense primer 

<400> 60 

ataagggcaa agaggaaagt 20 



<210> 61 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 14 sense primer 



<210> 62 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 14 antisense primer 

<400> 62 

aatctttctc ctttactagg aggta 25 



<210> 63 

<211> 25 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 15 sense primer 



<400> 61 

tttgttcttc ctttaaaacc g 



21 



<400> 63 

tttacttctt ttgtattgga agcca 



25 
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<210> 64 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 15 antisense primer 

<400> 64 

aaaaggccca aaacaataag 20 



<210> 65 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Exon 16-1 sense primer 



<210> 66 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 16-1 antisense primer 

<400> 66 

caggagtttc cagaattcaa 20 



<210> 67 

<211> 20 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 16-2 sense primer 



<400> 65 

atctgaaggg gtaattatta aaggc 



25 



<400> 67 

tgttgttttc tgttcccttg 



20 



<210> 68 
<211> 25 



Patent provided by Sughrue Mion, PLLC - http://www.sughrue.com 



WO 03/107249 2 $/23 PCT/JP03/07534 

<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Exon 16-2 antisense primer 
<400> 68 

tgttccagaa atggtgcaag aattc 25 
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